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FIELD or INVENTION 

The present invention relates to the field of 
molecular biology and methods for the isolation and 
expression of foreign genes in recombinant 

10 microorganisms. More specifically, the invention 

relates to the isolation, sequencing, and recombinant 
expression of nucleic acid fragments (genes) encoding 
a stereospecific, nitrile hydratase (NHase) activity 
capable of catalyzing the hydrolysis of certain 

15 racemic nitriles to the corresponding R- or S- amides. 
Additionally, the invention relates to the co- 
expression of the nitrile hydratase nucleic acid 
fragment with a nucleic acid fragment encoding a 
stereospecif ic amidase activity capable of converting 

20 a racemic mixture of R- and S- amides to the 

corresponding enantiomeric R— or S- carboxylic acids. 

BACKnnonNn 

Many agrochemicals and pharmaceuticals of the 
general formula X-CHR-COOH are currently marketed as 
25 racemic or diastereomer mixtures. In many cases the 
physiological effect derives from only one 
enant iomer/diastereomer where the other 
enantiomer/diastereomer is inactive or even harmful. 
Methods for synthesizing enantiomers are becoming 
increasingly important tools for the production of 
chemicals of enantiomer purity. To date, however, no 
recombinant, stereospecif ic NHase has been described 
capable of catalyzing the hydrolysis of certain 
racemic nitriles to the corresponding R- or S- amides. 

Methods for the selective preparation of stereo- 
specific amides from nitriles are known and 
incorporate microorganisms possessing nitrile 
hydratase activity (NHase) . These NHases catalyze the 
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addition of one molecule of water to the nitrile, 
resulting in the formation of the amide free product 
according to Reaction 1: 

Reaction 1 r-cn + h 2 o => rconh 2 

Similarly, methods for the stereospecif ic 
5 production of carboxylic acids are known and 

incorporate microorganisms possessing an amidase (Am) 
activity. In general amidases convert the amide 
product of Reaction 1 to the acid free product plus 
ammonia according to Reaction 2 : 

Reaction 2 rconh 2 => rcooh + nh 3 

10 A wide variety of bacterial genera are known to 

possess a diverse spectrum of nitrile hydratase and 
amidase activities including Rhodococcus f Pseudomonas, 
Alcaligenes , Arthrohacter, Bacillus, Bacterldium, 
Brevibacterium, CoryneJbacterlum, and Micrococcus . For 

15 example, nitrile hydratase enzymes have been isolated 
from Pseudomonas chlororaphls , B23 [Nishiyama, M. J., 
Bacterid., 173:2465-2472 (1991)] Rhodococcus 
rhodochrous Jl [Kobayashi, M., Biochem. Biophys . Acta, 
1129:23-33 (1991)] Brevibacterium sp. 312 (Mayaux 

20 et al., J. Bacterid., 172 :67 64-6773 (1990)), and 
Rhodococcus sp. N-774 [Ikehata, O., Nishiyama, M., 
Horinouchiv S., Beppu, T., Eur. J. Biochem., 181: 
563-570(1989)]. No disclosure of any stereoselective 
activity is made for any of these enzymes. Only two 

25 disclosures have been made for stereoselective nitrile 
hydratase activity in native bacterial strains. The 
Applicants have disclosed a stereospecif ic nitrile 
hydratase from P. putida NRRL-18668 [WO 92/05275 
(1990)]. 

30 Wildtype microorganisms known to possess nitrile 

hydratase activity have been used to convert nitriles 
to amides and carboxylic acids. For example, 
EPA 326,482 discloses the stereospecif ic preparation 
of aryl-2-alkanoic acids such as 2- (4-chlorophenyl) -3- 

2 
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methylbutyric acid by microbial hydrolysis of the 
corresponding racemic amide using members of 
Brevibacterium and Corynebacterium . Similarly, U.S. 
Pat. No. 4,366, 250 teaches the use of Bacillus, 
5 Bacterldium, Micrococcus and Brevijbacteriu/n in a 

method for the preparation of L-amino acids from the 
corresponding racemic amino nitriles. Finally, 
WO 92/05275 teaches a biologically-catalyzed method 
for converting a racemic alkyl nitrile to the 
10 corresponding R- or S-alkanoic acid through an 

intermediate amide using members of the bacterial 
genera Pseudomonas spp. (e.g., putida, aureofaciens, 
Moraxella spp.) and Serratia (e.g., Serratia 
liquefaciens) . 

15 In addition to the use of wildtype organisms, 

recombinant organisms containing heterologous genes 
for the expression of nitrile hydratase are also known 
for the conversion of nitriles. For example, 
Cerebelaud et al., (WO 9504828) teach the isolation 

20 and expression in E. coli of nitrile hydratase genes* 
isolated from C. testosteroni . The transformed hosts 
effectively convert nitriles to amides where the 
nitrile substrate consists of one nitrile and one 
carboxylate group. However, WO 9504828 does not teach 

25 a stereospecif ic conversion of nitriles. 

Similarly, Beppu et al., (EP 5024576) disclose 
plasmids carrying both nitrile hydratase and amidase 
genes from Rhodococcus capable of transforming JET. coli 
where the transformed host is then able to use 

30 isobutyronitrile and isobuty roamide as enzymatic 
substrates. However, EP 5024576 does not teach a 
stereospecif ic conversion of nitriles or amides. 

As with nitrile hydratases, microorganisms 
possessing amidase activity have been used to convert 

35 amides to carboxylic acids. In USSN 08/403911, 
Applicants disclose a method for converting an 
(S) -amide, or stereospecif ically converting a mixture 
of (R)- and (S) -amides to the corresponding 

3 
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enantiomeric (S) -carboxylic acid by contacting said 
amide with Pseudomonas chlororaphis B23 in a solvent. 
This method uses a wildtype microorganism and does not 
anticipate a recombinant catalyst or heterologous gene 
5 expression. Blakeyetal., FEMS Microbiology Letters, 
129:57-62 (1995) disclose a Rhodococcus sp. having 
activity against a broad range of nitriles and 
dinitriles and able to catalyze regio-specif ic and 
stereo-specific nitrile biotransformations. 

Genes encoding amidase activity have been cloned, 
sequenced, and expressed in recombinant organisms. 
For example, Azza et al., (FEMS Microbiol. Lett. 122, 
129, (1994)) disclose the cloning and over-expression 
in E . coll of an amidase gene from Brevibacterium sp. 
15 R312 under the control of the native promoter. 

Similarly, Kobayashi et al., (Eur. J. Biochem. , 211, 
327, (1993)) teach the cloning of both a nitrile 
hydratase and amidase gene from R. rhodococcus Jl and 
their co-expression in E. coli. 

What is needed and inventive over the prior art 
is a method for the stereospecif ic conversion of 
racemic alkyl nitriles to the corresponding R- or 
S-alkanoic acids using a recombinant organism. 

SUMMARY OF THF. THVf.mt j nfj 

This invention relates to nucleic acid fragments 
encoding: ,/ 

1) the a subunit of a stereospecif ic nitrile 
hydratase enzyme, said gene having at least a 64% base 
homology with the a subunit coding region of the 
Rhodococcus rhodochrous Jl L-NHase gene [Kobayashi, 
M., Biochem. Biophys . Acta, 1129:23-33 (1991)] and 
said enzyme capable of catalyzing the hydrolysis of 
racemic aryl-2-alkane nitriles to the corresponding R- 
or S- amides; and 

2) the p subunit of a stereospecif ic nitrile 
hydratase enzyme, said gene having at least a 52% base 
homology with the (3 subunit coding region of the 
Rhodococcus rhodochrous Jl L-NHase gene and said 
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enzyme capable of catalyzing the hydrolysis of racemic 
aryl-2-alkane nitriles to the corresponding R- or S- 
amides . 

Another embodiment of the invention is a nucleic 
5 acid fragment comprising the nucleic acid fragments 

encoding both the a and p subunits of a stereospecif ic 
nitrile hydratase enzyme described above, said enzyme 
capable of catalyzing the hydrolysis of racemic 
aryl-2-alkane nitriles to the corresponding R- or S- 
10 amides. 

A further embodiment of the invention is a 
nucleic acid fragment encoding the a subunit of a 
stereospecif ic nitrile hydratase enzyme, said nucleic 
acid fragment having the nucleotide sequence as 

15 represented in SEQ ID NO.:3 and said enzyme capable of 
catalyzing the hydrolysis of racemic alkyl nitriles to 
the corresponding R- or S- amides. 

A further embodiment of the invention is a 
nucleic acid fragment encoding the (3 subunit of a 

20 stereospecif ic nitrile hydratase enzyme, said nucleic 
acid fragment having the nucleotide sequence as 
represented in SEQ ID NO.: 4 and said enzyme capable of 
catalyzing the hydrolysis of racemic alkyl nitriles to 
the corresponding R- or S- amides. 

25 Still another embodiment of the invention is a nucleic 

acid fragment encoding both the a and p subunits of a 
stereospecif ic nitrile hydratase enzyme, said nucleic acid 
fragment having the nucleotide sequence as represented in 
SEQ ID NO.: 17 and said enzyme capable of catalyzing the 

30 hydrolysis of racemic aryl-2-alkane nitriles to the 
corresponding R- or S- amides. 

Further embodiments of the invention include 
1) the polypeptide a subunit of a 
stereospecif ic nitrile hydratase enzyme, said a 

35 subunit having the amino acid sequence as represented 
in SEQ ID NO . : 1 and said enzyme being capable of 
catalyzing the hydrolysis of racemic ary 1-2-alkane 
nitriles to the corresponding R- or S- amides; and 

5 
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10 



2) the polypeptide (3 subunit of a 
stereospecific nitrile hydratase enzyme, said 0 subunit 
having the amino acid sequence as represented in SEQ 
ID NO.:2 and said enzyme being capable of catalyzing 
the hydrolysis of racemic aryl-2-alkane nitriles to 
the corresponding R- or S- amides. 

A further embodiment of the invention is a 
stereospecific nitrile hydratase enzyme, said enzyme 
comprising the combined a and 0 subunits having the 
respective amino acid sequences SEQ ID NOs . : 1 and 2 in 
proper conformation such that said enzyme catalyzes 
the hydrolysis of racemic aryl-2-alkane nitriles to 
the corresponding R- or S- amides. 

A still further embodiment of the invention is a 
15 6.5 kb nucleic acid fragment encoding a nitrile 
hydratase enzyme and the accessory nucleic acid 
fragments necessary for the enzymes 1 s active 
expression and further characterized by the 
restriction fragment map shown in Figure 2. This 
6.5 kb nucleic acid fragment is incorporated into an. 
expression vector capable of transforming a suitable 
host cell for the expression of active stereospecific 
nitrile hydratase as characterized by the plasmid map 
shown in Figure 3. 

25 The invention further provides a region of the 

P. putidia^ genome encompassed within the 6.5 kb 
fragment, designated P14K, which encodes a polypeptide 
that is necessary for the bioactivity of the 
stereospecific nitrile hydratase enzyme isolated from 

30 Pseudomonas putida NRRL-18668. 

Additionally the invention provides a nucleic 
acid fragment encoding a 18668 amidase having an amino 
acid sequence as represented in SEQ ID NO.:28, wherein 
the amino acid sequence may encompass amino acid 

35 substitutions, deletions or additions that do not 

alter the function of said amidase. The 18668 amidase 
is isolated from Pseudomonas putida NRRL-18668 and is 
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distinct from the amidase isolated from Pseudomonas 
chlororaphis B-23 (FERM B-187) . 

The present invention further provides 
recombinant hosts, transformed with the nucleic acid 
5 fragment encoding a 18668 amidase and/or the qenes 
encoding the a, P nitrile hydratase subunits and the 
P14K region of the Pseudomonas putida NRRL-18668 
genome . 

The invention also provides methods for the 
10 conversion of racemic nitriles to the corresponding R- 
or S-amides or corresponding enantiomeric R- or S- 
carboMylic acids using the above transformed hosts 
containing nucleic acid fragments encoding a 18668 
amidase and/or the genes encoding the a, (3 nitrile 
15 hydratase subunits and the P14K region of the 
Pseudomonas putida NRRL-18668 genome. 

Other embodiments of the invention are: 

1) a transformed microbial host cell 
comprising the nucleic acid fragment represented by 

20 SEQ ID NO.: 17 wherein said host cell expresses active 
nitrile hydratase enzyme capable of catalyzing the 
hydrolysis of racemic aryl-2 alkane nitriles to the 
corresponding R- or S- amides; and 

2) a transformed microbial host cell 
25 comprising the 6.5 kb nucleic acid fragment 

characterized by the restriction map shown in Figure 2 
wherein said host cell expresses active nitrile 
hydratase enzyme capable of catalyzing the hydrolysis 
of racemic aryl-2 alkane nitriles to the corresponding 
30 R- or S- amides. 

Other embodiments of the invention are host cells 
transformed with nucleic acid fragments represented by 
SEQ ID NO,: 17 or the restriction maps of Figures 2 and 
3, wherein the host cell is selected from the group 
35 consisting of bacteria of the genera Escherichia, 

Pseudomonas, Rhodococcus, Acinetobacter, Bacillus , and 
Streptomyces, yeast of the genera Pichia, Hansenula, 
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and Saccharomyces , and filamentous fungi of the genera 
Aspergillus, Neurospora, and Penicillium . 

A particular embodiment of the invention is 
Escherichia coll transformed with the nucleic acid 
5 fragment represented by SEQ ID NO.: 17 or the nucleic 
acid fragment represented by the restriction map of 
Figure 2 . 

A further embodiment of the invention is an 
expression vector described in Figure 6 comprising 

10 .1) a 5.0 kb nucleic acid fragment from the 6.. 5 kb 

fragment of Claim 10, and 2) a nucleic acid fragment 
having the nucleic acid sequence as given in SEQ ID 
NO.:20, wherein said nucleic acid fragment encodes an 
amidase enzyme, and wherein said expression vector is 

15 capable of transforming suitable host cells for the 
co-expression of active s t ereospecif ic nitrile 
hydratase and amidase. A further embodiment is a host 
cell transformed with this expression vector wherein 
more particularly the host is selected from the group 

20 consisting of the genera Escherichia, Pseudomonas, 
Rhodococcus, Acinetobacter , Bacillus, Streptomyces, 
Hansenula, Saccharomyces, Pichia, Aspergillus, 
Neurospora, and Penicillium. A further embodiment is 
Escherichia coli SW17 transformed with pSW17. 

25 A further embodiment of the invention is a method 

for converting a nitrile of the formula 




I 

(R.S) 



wherein : 
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A is selected from the group consisting of: 



H 

CI N 




F 

A-10 A-11 



R 1 is C x -C 4 alkyl; 
R 2 is H or OH; 

R 3 is H, CI, OCF 2 H r <CH 3 ) 2 CHCH 2 , H 2 C=C ( CH 3 ) CH 2 NH , 

° O 

®CJ N - ■ & m2 ~ <§>-°- . 
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; and 



R 4 is CI or F; 
to the corresponding amide comprising contacting said 
nitrile with the transformed host cell containing a 
nucleic acid fragment having the nucleotide sequence 
5 represented by SEQ ID NO.: 17 that stereospecif ically 
converts the racemic nitrile to the corresponding 
enantiomeric R- or S-amide, the host cell selected 
from the group consisting of Escherichia, Pseudomonas, 
Rhodococcus, Acinetobacter , Bacillus, Streptomyces , 

10 Hansenula, Saccharomyces , Plchla, Aspergillus, 
Neurospora, and Penicillium . 

The Applicants also provide a method for the 
conversion of the above described nitrile to 
corresponding enantiomeric (R) or ( S) -carboxylic acid 

15 by contacting the nitrile with the transformed host 
comprising an expression vector comprising a nucleic 
acid fragment represented by Figure 2 and the nucleic 
acid sequence of SEQ ID NO.:20, the host cell selected 
from the group consisting of Escherichia, Pseudomonas, 

20 Rhodococcus, Acinetobacter, Bacillus, Streptomyces, 
Hansenula, , Saccharomyces, Plchla, Aspergillus, 
Neurospora, and Penicillium . 

A further embodiment of the invention is a 
nucleic acid fragment encoding the a and P subunits of 

25 a stereospecif ic nitrile hydratase enzyme, said 

portion of the nucleic acid fragment encoding the a 
subunit having at least a 64% base homology to the 
Rhodochrous Jl L-NHase gene and said portion of the 
nucleic acid fragment encoding the (3 subunit having a 

30 52% base homology to the Rhodochrous Jl L-NHase gene, 
and said enzyme capable of catalyzing the hydrolysis 
of racemic ary 1-2-alkane nitriles to the corresponding 
R- or S- amides. 

10 
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Yet another embodiment of the invention is the 
polypeptide encoded by any one of the nucleic acid 
fragments of the invention. 

Embodiments of the invention are plasmids pSW2 
5 carried in SW2 and designated as ATCC 69888, pSW17 
carried in SW17 cind designated as ATCC 69887, pSW50 
carried in P. pastorls SW50.2 and designated as 
ATCC 74391, pSW37 carried in E. coll SW37 and 
designated as ATCC 98174, and pSW23 carried in E. coll 
10 5W23 and designated as ATCC 98175. 

BRIEF DESCRIPTION OF THE FIGURES 

BIOLOGICAL DEPOSITS AND SEQUENCE LISTING 
Figure 1 is a plasmid map of the plasmid pSWl 
containing a 6.5 kb DNA fragment which encodes the a 
15 and (3 subunits of the nitrile hydratase enzyme 
isolated from P. pvtlda (NRRL-18668) . 

Figure 2 is a restriction map of the 6.5 kb 
nucleic acid fragment which includes the nitrile 
hydratase gene isolated from P. putlda (NRRL-18668) 
20 showing the location of the a and P subunits. 

Figure 3 is a plasmid map of the plasmid pSW2 
created by inserting the 6.5 kb DNA fragment 
comprising the genes encoding the a and (J subunits of 
nitrile hydratase into the wide-host-range vector 
25 pMMB2 07 . 

Figure 4 is a plasmid map of the plasmid pSW5 
created by inserting a 2.8 kb subclone of the 6.5 kb 
nucleic acid fragment comprising the genes encoding 
the a and (3 subunits of nitrile hydratase into the wide- 
30 host-range vector pMMB207 . 

Figure 5 is a western blot analysis showing the 
production of NRRL-18668 nitrile hydratase protein in 
E- coll. (A) Coomassie Blue stained SDS-PAGE gel of 
protein extracts from uninduced (u) and induced (i) 
35 E. coll transformed with the plasmid pSW2 . 

(B) Western blot analysis of duplicate gel shown in 
<A) using anti-NH sera. M, protein molecular weight 
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markers; NH, nitrile hydratase protein from NRRL- 
18668. Arrow indicates NH . 

Figure 6 is a plasmid map of the plasmid pSW17 
created by inserting a 1.5 kb DNA fragment comprising 
5 the gene encoding amidase from Pseudomonas 

chlororaphis B23, and a 5.0 kb subclone of the 6.5 kb 
DNA fragment comprising the genes encoding the a and {} 
subunits of nitrile hydratase into the wide-host-range 
vector pMMB2 07 . 

10 Figure 7 illustrates the nucleotide and amino 

acid sequences of the Pseudomonas putida (NRRL-18668) 
a and p nitrile hydratase coding regions also found in 
SEQ ID NO. : 17 . 

Figure 8 is a restriction map of the 6.5 kb 
15 nucleic acid fragment which includes the nitrile 

hydratase gene isolated from P. putida (NRRL-18668) 
plus sequence upstream of the EcoRl site (shown in 
Figure 2) including a new Pstl site. 

Figure 9 is a restriction map of the 6.5 kb 
20 nucleic acid fragment which includes the nitrile 

hydratase gene isolated from P. putida (NRRL-18668) 
plus sequence upstream of the new Pstl site (shown in 
Figure 8) including a new EcoRl site. 

Figure 10 is a restriction map of an 8 kb nucleic 
25 acid fragment showing the 6.5 kb nucleic acid fragment 
which includes the nitrile hydratase gene isolated 
from P. putida (NRRL-18668), P14K, and the region 
encoding a P. putida (NRRL-18668) amidase enzyme. 

Figure 11 is a plasmid map of pHIL-D4B2 created 
30 by replacing the 0 . 9 kb EcoRl/Xbal fragment in pHIL-D4 
with the 0.9 kb EcoRl/Xbal fragment from pA0815. 

Figure 12 is a plasmid map of pSW46 created by 
the insertion of the a gene of the nitrile hydratase 
enzyme into the EcoRl site of pHIL-D4B2. 
35 Figure 13 is a plasmid map of pSW47 created by 

the insertion of the (} gene of the nitrile hydratase 
enzyme into the EcoRl site of pHIL-D4B2 . 
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Figure 14 is a plasmid map of pSW48 created by 
the insertion of the P14K gene into the EcoRl site of 
pHIL-D4B2 . 

Figure 15 is a plasmid map of pSW49 containing 
5 the a and (3 expression cassettes from pSW4 6 and pSW47. 

Figure 16 is a plasmid map of pSWSO containing 
the <x ? P and P14K expression cassettes from pSW46, 
pSW4 7 and pSW4 8. 

Figure 17 is a plasmid map of pSW37 containing 
10 the expression cassette for the amidase isolated from 
P. pvtlda (NRRL-18668) . 

Figure 18 is a plasmid map of pSW23 containing 
the expression cassette for the amidase, a, P and P14K 
isolated from P „ putlda (NRRL-18668) . 
15 Applicants have provided sequence listings 1-28 

in conformity with 37 C.F.R. 1.821-1.825 and 
Appendices A and B ("Requirements for Application 
Disclosures Containing Nucleotides and/or Amino Acid 
Sequences") and in conformity with "Rules for the 
20 Standard Representation of Nucleotide and Amino Acid 

Sequences in Patent Applications" and Annexes I and II 
to the Decision of the President of the EPO, published 
in Supplement No. 2 to OJ EPO, 12/1992. 

Applicants have made the following biological 
25 deposits under the terms of the Budapest Treaty on the 
International Recognition of the Deposit of 
Micro-organisms for the Purposes of Patent Procedure: 

Depositor Identification Reference Intl. Depository Designation Date of Deposit 

Pseudomonas Putida NRRL 18668 6 July 1990 

Escherichia coli SW2 carrying pSW2 ATCC 69888 15 August 1995 

Escherichia coli SW17 carrying pSWI7 ATCC 69887 15 August 1995 

Pichia pastoris SW502 carrying pSW50 ATCC 74391 20 September 1996 

E. coli SW37 carrying pSW37 ATCC 9817* 20 September 1996 

E. coli SW23 carrying pSW23 ATCC 98175 20 Septem ber 1 996 

As used herein, "NRRL" refers to the Northern 
Regional Research Laboratory, Agricultural Research 
30 Service Culture Collection International Depository 
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Authority located at 11815 N. University Street, 
Peoria, IL 61604 U.S.A. The "NRRL No." is the 
accession number to cultures on deposit at the NRRL. 
As used herein, "ATCC" refers to the American 
5 Type Culture Collection International Depository 

Authority located at 12301 Parklawn Drive, Rockville, 
MD 20852 U.S.A. The "ATCC No." is the accession 
number to cultures on deposit with the ATCC. 

DETAILED DESCRIPTION OF th f. rwr.rj TT n N 
The present invention provides genes derived from 
PsBudomonas putida (NRRL-18668) which encode two 
polypeptides, which, in combination, have the ability 
to act as a catalyst to selectively hydrate one 
nitrile enantiomer in a racemic mixture to produce the 
15 chiral amide. This invention also provides a 

recombinant nucleic acid fragment containing the genes 
and a set of transformed microbial cell hosts 
containing the recombinant nucleic acid fragment. .The 
invention further provides a method for the production 
20 of the polypeptide catalysts using the transformed 

microbes and the use of the catalyst in chiral amide 
production. Additionally, the invention provides for 
the co-expression in a transformed host of the nitrile 
hydratase genes with the genes encoding a 
25 stereospecif ic amidase derived from Pseudomonas 

chlororaphis B-23 (FERM B-187) for the production of 
chiral acids. 

The following definitions are used herein and 
should be referred to for interpretation of the claims 
30 and the specification. 
Abbrgvi at inn^ 

CPIA - 2- (4-chlorophenyl) -3-methy lbutyric acid 

CPIAm - 2- (4-chlorophenyl) -3-methy lbutyramide 

CPIN - 2- (4-chlorophenyl) -3-methylbutyronitrile 

3 5 GC - Gas Chromatography 

HPLC - High-Perf ormance Liquid Chromatography 

IPTG - isopropyl-b-D-thiogalatopyranoside 
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SDS Page - Sodium dodecyl sulfate polyacrylimide 

gel electrophoresis 
The term "nitrile hydratase" refers to an enzyme 
isolated from the bacteria Pseudomonas putida 
5 (NRRL-18668) which is characterized by its ability to 
convert a racemic alkyl nitrile to the corresponding 
enantiomeric R- or S-amide through an intermediate 
amide where the starting nitrile is: 

I 

A— C CN 

R2 
I 

(R.S) 

and wherein: 

10 A is selected from the group consisting of: 

H 
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A-8 




A-10 



R 1 is C x -Ci alkyl; 

R 2 is H or OH; 

R 3 is H, CI , OCF 2 H, 





A-9 



and 



A-1 1 



<CH 3 ) 2 CHCH 2 , K 2 C=C(CH 3 )CH 2 NH, 




10 



15 




O 
II 



• o-" 



or 



o 

II 

c- 



; and 



R 4 is CI or F. 
More specifically, the enzyme has an ability to 
connect the racemic alkyl nitrile to the corresponding 
enantiomeric R- or S-alkanoic acid through an 
intermediate amide. 

The instant nitrile hydratase is further defined 
by the amino acid sequences of its a and p subunits as 
respectively given in SEQ ID NO.:l and SEQ ID NO.:2 
which are encoded by the a and (5 nitrile hydratase 
subunit genes whose base sequences are respectively 
given by SEQ ID NO . : 3 and SEQ ID NO . : 4 . 

The term "amidase" refers to an enzyme naturally 
found in the bacterium Pseudomonas putlda B2 3 (FERM 
B-187) which is characterized by its ability to 
convert amides of the structure: 
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R 1 
I 

A— CH— C— NH ? 
II 

O 

1 

wherein : 

A is selected from the group consisting of: 




A-l A-2 



R 1 is C x -C 4 alkyl; 

R 2 is H; F; CI; Br; OH; C x -C 3 alkyl; OCF 2 H; or 
5 H 2 C=C (CH 3 ) CH 2 NH; and 

R 3 is H; F; CI; Br; OH; C x -C 3 alkyl; or C x -C 3 
alkoxy ; 

to the corresponding enantiomeric (R) or 
(S) -carboxylic acid. The amidase of the instant 

10 invention is further identified by the amino acid 
sequence given in Nishiyama et al . , Bacterial . , 
173:2465-2472 (1991) and the DNA base sequence 
disclosed in SEQ ID NO.: 20. 

The term "18668 amidase" refers to an enzyme 

15 naturally found in the bacterium Pseudomonas putida 
NRRL-18668 which is characterized by its ability to 
convert C3 to C6 amides to the corresponding acids. 
In addition, as described in PCT/DK91/00189, the 18668 
amidase is characterized by the ability to convert 

20 some (R, S ) -aryl-2-alkane nitriles to the corresponding 
enantiomerically enriched (R) or (S) -carboxylic acid. 
The amidase of the instant invention is further 
identified by the amino acid sequence given in SEQ ID 
N0.:28 and the DNA base sequence disclosed in SEQ ID 

25 NO.:27. The "18668 amidase" is disctinct form the 
amidase isolated from bacterium Pseudomonas putida 
B23 (FERM B-187) . 
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The term "P14K gene" refers to a region of the 
Psevdomonas putlda NRRL-18668 genome encoding a 
polypeptide as given by SEQ ID NO.:22 having the base 
sequence as given by SEQ ID NO . : 2 1 , where the 
5 expression of the P14K gene is essential for the 
- bioactivity of the Pseudomonas putlca NRRL-18668 
nitrile hydratase enzyme. The term "P14K polypeptide" 
(or "P14K protein") refers to the active polypeptide 
encoded by the P14K region. 
10 "Transformation" refers to the acquisition of new 

genes in a cell by the incorporation of nucleic acid. 

The term "nucleic acid" refers to complex 
compounds of high molecular weight occurring in living 
cells, the fundamental units of which are nucleotides 
15 linked together with phosphate bridges. Nucleic acids 
are subdivided into two types: ribonucleic acid (RNA) 
and deoxyribonucleic acid (DNA) . 

The terms "host cell" and "host organism" refer 
to a microorganism capable of incorporating foreign or 
20 heterologous genes and expressing those genes to 
produce an active gene product. 

The terms "foreign gene", "foreign DNA", 
"heterologous gene", and "heterologous DNA" refer to 
genetic material native to one organism that has been 
25 placed within a host organism. 

The terms "recombinant organism", "transformed 
host", and "transformed microbial host" refer to an 
organism having been transformed with heterologous or 
foreign genes. The recombinant organisms of the 
30 present invention express foreign genes encoding 
active nitrile hydratase and amidase enzymes. 

The term "nucleic acid fragment" refers to a 
fragment of DNA that may encode a gene and/or 
regulatory sequences preceding (5" non-coding) and 
35 following (3" non-coding) the coding region (gene) . 

The term "expression" refers to the transcription 
and translation to gene product from a gene coding for 
the sequence of the gene product, usually a protein. 
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The terms "plasmid" and "vector" refer to an 
extra chromosomal element often carrying genes which 
are not part of the central metabolism of the cell, 
and usually in the form of circular double-stranded 
5 DNA molecules. Such elements may be autonomously 

replicating sequences, genome integrating sequences, 
phage sequences, linear or circular, of a single- or 
double-stranded DNA or RNA, derived from any source. 
The term "cassette" refers to a number of 

10 nucleotide sequences which have been joined or 

recombined into a unique construction. An "expression 
cassette" is specifically comprised of a promoter 
fragment, a DNA sequence for a selected gene product, 
and a transcriptional termination sequence. 

15 The terms "restriction endonuclease" and 

"restriction enzyme" refer to an enzyme which 
catalyzes hydrolytic cleavage within a specific 
nucleotide sequence in double-stranded DNA. 

The term "promoter" refers to a sequence of DNA, 

20 usually upstream of (5' to) the protein coding 

sequence of a structural gene, which controls the 
expression of the coding region by providing the 
recognition for RNA polymerase and/or other factors 
required for transcription to start at the correct 

25 site. 

A "fragment" constitutes a fraction of the 
complete nucleic acid sequence of a particular region. 
A fragment may constitute an entire oene. 

The terms "peptide", "polypeptide" and "protein" 
30 are used interchangeably to refer to the gene product 
expressed . 

The terms "encoding" and "coding" refer to the 
process by which a gene, through the mechanisms of 
transcription and translation, produces an amino acid 
35 sequence. The process of encoding a specific amino 

acid sequence includes DNA sequences that may involve 
base changes that do not cause a change in the encoded 
amino acid, or which involve base changes which may 
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alter one or more amino acids, but do not affect the 
functional properties of the protein encoded by the 
DNA sequence. It is therefore understood that the 
invention encompasses more than the specific exemplary 
5 sequences. Modifications to the sequence, such as 
deletions, insertions, or substitutions in the 
sequence which produce silent changes that do not 
substantially affect the functional properties of the 
resulting protein molecule are also contemplated. For 
example, alteration in the gene sequence which reflect 
the degeneracy of the genetic code, or which result in 
the production of a chemically equivalent amino acid 
at a given site, are contemplated. Thus, a codon for 
the amino acid alanine, a hydrophobic amino acid, may 
15 be substituted by a codon encoding another less 
hydrophobic residue, such as glycine, or a more 
hydrophobic residue, such as valine, leucine, or 
isoleucine. Similarly, changes which result in 
substitution of one negatively charged residue for 
20 another, such as aspartic acid for glutamic acid, or 
one positively charged residue for another, such as 
lysine for arginine, can also be expected to produce a 
biologically equivalent product. Nucleotide changes 
which result in alteration of the N-terminal and 
25 C-terminal portions of the protein molecule would also 
not be expected to alter the activity of the protein. 
In some cases, it may, in fact, be desirable to make 
mutants of the sequence in order to study the effect 
of alteration on the biological activity of the 
30 protein. Each of the proposed modifications is well 
within the routine skill in the art, as is 
determination of retention of biological activity in 
the encoded products. Moreover, the skilled artisan 
recognizes that sequences encompassed by this 
35 invention are also defined by their ability to 

hybridize, under stringent conditions (0.1X SSC, 0.1% 
SDS, 65°C) , with the sequences exemplified herein. 
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"Homology" refers to the degree to which two 
nucleic acid fragments contain the same base sequence. 
"Homology" is determined by the operation of an 
algorithim and is expressed as a percentage of the 
5 base sequence that is the same in both fragments. 

Applicants have accomplished the following which 
are discussed in more detail below and in the 
Examples : 

I. identified and cloned genes for (i) a 
10 stereospecif ic NHase from NRRL-18668. comprising both 
the a-subunit of the amino acid sequence identified in 
the Sequence Listing by SEQ ID NO.ri and the p-subunit 
of the amino acid sequence identified in the Sequence 
Listing by SEQ ID NO . : 2 ; (ii) an amidase from 
15 NRRL-18668 with deduced amino acid sequence identified 
in the Sequence Listing by SEQ ID NO.:28; (iii) a gene 
from NRRL-18668 designated P14K which is essential for 
NRRL-18668 NHase activity and with deduced amino acid 
sequence identified in the Sequence Listing by SEQ ID 
2 0 NO.: 22; 

II. obtained DNA sequences encoding the 
a-subunit identified in the Sequence Listing by SEQ ID 
NO.:3; and the fi-subunit identified in the Sequence 
Listing by SEQ ID NO . : 4 ; and the amidase enzyme 

25 identified in the Sequence Listing by SEQ ID NO.:27; 
and the P14K polypeptide identified in the Sequence 
Listing by SEQ ID NO.:21; 

III. constructed recombinant DNA plasmids 
containing the genes as described in I above located 

30 within an 8.0 kb DNA fragment as described in 
Figure 10. 

IV. transformed microbial hosts with the 
plasmids described in III above as described in 
Figures 3, 15, and 16; 
35 V. developed a method for the production of 

stereospecif ic NHase which comprises growing a 
transformed host described in IV and recovering the 
nitrile hydrating activity from the culture; 
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VI. developed a method for the production of 
chiral amides which comprises stereospecif ically 
hydrating the nitrile using the nitriie hydrating 
activity recovered in V; 

5 VII. developed a method for the production of 

chiral amides which comprises stereospecif ically 
hydrating the nitrile using the nitrile hydrating 
activity recovered in V for the production of chiral 
amides using isolated microbial cells as described in 
10 IV, the treated matter thereof, or a fixed form of 
them; 

VIII. constructed recombinant DNA plasmids 
containing the NHase genes as described in I above, in 
combination with the amidase gene derived from 
15 Pseudomonas chlororaphls B23 (FERM 5-187) or the 
amidase gene described in I above; 

IX. transformed microbial hosts with the 
plasmids described in VIII above as described in 
Figures 6 and 18; 

X. developed a method for the production of 
NHase and amidase which comprises growing a 
transformed host described in IX and recovering the 
nitrile hydrating and amide hydrating activity from 
the culture; and 

25 XI * developed a method for the production of 

chiral amides and chiral acids which comprises 
stereoselective hydration of the nitrile and its amide 
products using the NHase and amidase activities 
. recovered in V for the production of the chiral 

30 products using isolated microbial cells as described 
in IX, the treated matter thereof, or a fixed form of 
them to produce chiral products. 

I- I SO L ATION ANP CT,PNTNG OF THE nttrilf. h ydratasf 

GENE : 

35 A - I SQ l at i OXl and Partial Amino Arid Sequpnriny 

Pf the N i trile Hvrlratasg F.ny.ynu*- 
The instant invention provides a nitrile 
hydratase enzyme which is defined above. The nitrile 
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hydratase of the present invention ---as isolated and 
purified from Pseudomonas putida (NRRL-18668 ) . 
Bacterial nitrile hydratases are known to be generally 
comprised of structurally distinct o: and 0 subunits 
5 (Hashimoto et al., Biosci., Biotechnol. , Biochem. , 
58(10), 1859-65 (1994)). The instant nitrile 
hydratase was separated into a and (3 subunits using 
HPLC methodology. Methods for the purification and 
separation of enzymes by HPLC are common and known in 
10 the art. See, for example, Rudolph et al., 

Chromatogr. Sci . , 51 (HPLC Biol. Macromol.), 333-50 
(1990) . 

N-terminal amino acid sequences of each subunit 
were determined using methods well !:nown in the art. 

15 See, for example, Matsudaira, P., Methods Enzymol., 

182 (Guide Protein Purif.), 602-13 (1990). Fragments 
of each subunit were generated and partial amino acid 
sequences of the fragments were determined. Partial 
sequences of the a and (J subunits of this nitrile 

20 hydratase are shown in SEQ ID NOs.:5-9 and 10-13 f 
respectively . 

B. DNA Probe for Tsnlatin n of rhP Nitrile 
Hydratase G^n^r 

In order to isolate the nitrile hydratase gene, a 
25 series of degenerate 21-mer oligonucleotide primers 
based on the available NRRL-18668 NHase amino acid 
sequence were designed and synthesized for use as 
polymerase chain reaction (PCR) primers. Genomic DNA 
was isolated from P. putida (NRRL-18668) by standard 
30 methods (Sambrook, J., et al., Molecular Cloning: & 
Laboratory Manual. Second Edition, Cold Spring Harbor 
Laboratory Press (1989)) and was used as a target for 
PCR with numerous degenerate primer combinations. The 
resulting amplified products were subjected to 
35 Southern analysis (Southern, E. M. , J. Mol . Biol., 98, 
503, (1975)) using isolated Rhodococcus rhodochrous Jl 
L-NHase gene (Kobayashi, M . , Biochem. Biophys . Acta 
1129:23-33 (1991)) as a probe. One strongly 
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hybridizing fragment of 0.7 kb was identified from a 
PCR reaction based on the degenerate primers 
designated Dl and D7 . The sequences of Dl and D7 are 
identified in the Sequence Listing as SEQ ID NO.:14 
5 and SEQ ID NO.:15, respectively. The 0.7 kb PCR 
fragment was subcloned into the plasmid M13 using 
standard methods (Sambrook, supra) and sequenced. 
Sequencing revealed that the 0.7 kb fragment 
demonstrated a 60% base homology to the Rhodococcus 
rhodochrous Jl L-NHase gene. Deduced amino acid 
sequence from this 0.7 kb fragment was compared to 
available NRRL-18668 amino acid sequences determined 
previously and to other known NHase sequences. The 
comparison confirmed that this fragment was part of 
15 the P. putlda NHase gene. The 0.7 kb DNA fragment was 
sequenced and is identified as SEQ ZD No. :16. The 
0.7 kb fragment was used as a probe to isolate a 
genomic DNA fragment from NRRL-18668 which contains, 
the entire NHase gene . 
20 C. Isolation of a Genomir DNA r^ym^f 

Containina NRRL-1866R nh* s ~ GfitUfi ; 
Genomic DNA isolated from P. putlda (NRRL-18668) 
was digested with restriction enzymes EcoRl and Xhol 
and size-selected by agarose gel electrophoresis based 
25 on Southern blotting using the 0.7 kb DNA fragment 
described above as a probe . Restricted genomic DNA 
was then cloned into phage lambda ZAPII [Stratagene, 
La Jolla, CA] . The lambda library was screened with 
the 0.7 kb DNA fragment probe and one positively 
30 hybridizing phage clone with a DNA insert of 6.5 kb 
was identified and isolated. 

D - P l asm i d Construction and Host Tran.sforni^inn 

and Confirmation of nh * S q Spqnpnnp r 
Once a positive clone containing a 6.5 kb insert 
35 was identified, the presence of the NHase gene in the 
clone was confirmed by a process of (i) constructing a 
plasmid containing the 6.5 kb insert (pSWl, Figure 1); 
(ii) transforming a suitable host cell with this 
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plasmid; (iii) growing up the transformed host and 
purifying the plasmid DNA; (iv) constructing a 
restriction map from the purified D!TA (Figure 2); and 
(v) sequencing the NHase genes. The confirmation 
5 process is common and well known in the art and 
techniques used may be found in Sambrook supra. 

Sequence analysis confirmed the nitrile hydratase 
coding regions, which consisted of two open reading 
frames corresponding to the alpha and beta subunits of 

10 the corresponding NHase protein as defined in the 

Sequence Listing by SEQ ID NO.:17 and Fig. 7. The a 
and 3 open reading frames were analyzed for base 
sequence similarly to the Rhodococcus rhodochrous Jl 
L- NHASE gene used as a probe and described above. 

15 Homology comparisons showed that the a open reading 
frame had 64% homology to the region encoding the a 
subunit on the Jl gene and the p open reading frame had 
52% homology to the region encoding the P subunit on 
the Jl gene. 

20 II. CONSTRUCTION Of E XPRESSION VECTOR AND EXPRESSION 

STRAINS : 

The present invention provides a transformed host 
cell capable of expressing active nitrile hydratase 
enzyme. Generally, it is preferred if the host cell 

25 is an E. coli, however, it is not outside the scope of 
the invention to provide alternative ' hosts . Such 
alternative hosts may include, but are not limited to, 
members of the genera Pseudomonas , Rhodococcus, 
Aclnetobacter, Bacillus, Saccharomyces , Pichla, 

30 Aspergillus, Hansenula, and Streptomyces . 

The present invention provides a variety of 
plasmids or vectors suitable for the cloning of the 
nitrile hydratase gene in the desired host. Suitable 
vectors for construction contain a selectable marker 

35 and sequences allowing autonomous replication or 
chromosomal integration. Additionally, suitable 
vectors for expression contain sequences directing 
transcription and translation of the heterologous DNA 

25 
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fragment. These vectors, comprise a region 5* of the 
heterologous DNA fragment which harbors 

transcriptional initiation controls, and optionally a 
region 3 f of the DNA fragment which controls 
5 transcriptional termination. It is most preferred 
when both control regions are derived from genes 
homologous to the host cell, although such control 
regions need not be derived from the genes native to 
the specific species chosen as a production host. 

10 Suitable vectors can be derived, for example, from a 
bacteria (e.g., pET, pBR322, pUC19, pSP64, pUR278 and 
pORFl), a virus (such as bacteriophage T7 or a M-13 
derived phage), a cosmid, a yeast or a plant. 
Protocols for obtaining and using such vectors are 

15 known to those in the art. (Sambrook. supra.) 

Vectors suitable for E. coll will have compatible 
regulatory sequences and origins of replication. They 
will be preferably multicopy and have a selectable 
marker gene, for example, a gene coding for antibiotic 

20 resistance. 

Promoters useful for driving the expression of 
heterologous DNA fragments in E. coll are numerous and 
familiar to those skilled in the art. Virtually any 
promoter capable of driving the gene encoding the 

25 nitrile hydratase enzyme is suitable for the present 
invention,; although promoters native to E. coll are 
preferred and the inducible IPTG Ptac promoter is most 
preferred (deBoer, H . , Proc. Natl . Acad. Scl . USA, 
80:21-25 (1983). Although an inducible promoter is 

30 preferred, one of skill in the art will appreciate 
that either inducible or constitutive promoters are 
suitable . 

Within the context of the present invention the 
entire 6.5 kb DNA insert containing the NRRL-18668 
35 NHase gene in the plasmid pSWl was subcloned into the 
wide-host-range vector pMMB207 (Bagdasarian, M . , Gene, 
97:39-47 (1991)) under the control of the Ptac 

r 

promoter to create an expression vector designated 
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pSW2 (Figure 3). Additionally, the 2.8 kb Pstl DNA 
fragment derived from the 6.5 kb DNA fragment and 
containing the NRRL-18668 NHase gene but with 
substantially less upstream and downstream flanking 
5 sequence, was also subcloned into the vector pMMB207 
under the control of the Ptac promoter to generate the 
plasmid pSW5 (Figure 4) . Comparing these two 
expression constructs allowed Applicants to 
investigate proximal accessory sequences or proteins 

10 which might be involved in expression or activity of 
NHase, Applicants' studies indicated that the NHase 
genes may be part of an operon which generates a 10 kb 
mRNA transcript, of which only approximately 1.5 kb is 
accounted for by NHase. This suggests that additional 

15 genes are encoded by the upstream and downstream 
sequence flanking NHase. Others have described a 
requirement for downstream sequence for efficient 
expression of NHase in Rhodococcus sp. N-774 
(Hashimoto, Y., Biosci. Biotech. Biochem. , 

20 58:1859-1865 (1994)). 

Following cloning, E. coli XLl-Blue host was 
transformed in parallel with the plasmid pSW2 or pSW5 
described above. Methods of transforming host cells 
with foreign DNA are common and well known in the art. 

25 For example, transforming host cells with foreign DNA 
may be accomplished using calcium-permeabilized cells, 
electroporation, or by transfection using a 
recombinant phage virus. (Sambrook supra). Plasmid 
DNA was isolated from these transf ormants and enzyme 

30 restriction analysis confirmed the construction of two 
separate strains, one harboring the pSW2 plasmid and 
the other harboring the pSW5 plasmid. 

The gene encoding the a subunit, and the gene 
encoding the p subunit of NRRL-18668 NHase were also 

35 expressed in an alternative host, the methylotrophic 
yeast Pichia pastor is . Methods for producing 
heterologous proteins in P. pastoris are well known in 
the art. For each subunit, the coding sequence was 

27 



BNSDOCID: <WO 9712964A2_I_> 



WO 97/12964 



PCT/US96/15969 



10 



placed under control of the methanol inducible 
promoter, alcohol oxidase I (AOX1) . in a vector which 
was subsequently integrated into the host chromosome. 
Each subunit was produced in the respective host after 
induction by methanol. NHase activity was not 
reproducibly obtained upon mixing extract prepared 
from the a producing strain with extract prepared from 
the p producing strain. In addition, a single strain 
producing both a and P subunits under control of the 
AOX1 promoter was constructed. Both subunits were 
produced in this recombinant P. pastoris strain, but 
NHase activity was not obtained. 

Applicants sequenced DNA both upstream and 
downstream of the NHase genes, and identified at least 
15 two open reading frames, one upstream and one 

downstream. The upstream open reading frame was 
determined to encode an amidase enzyme, based on 
comparison of the deduced amino acid sequence to other 
amidase amino acid sequences. Plasmids were 
constructed for the expression of NRRL-18668 amidase 
in E. coli. A search of the protein database with the 
deduced amino acid sequence encoded by the downstream 
open reading frame (designated P14K) indicated no 
significant matches. Plasmids were constructed for 
25 expression of NHase genes only or NHase and P14K genes 
in both E.< coli and P. pastoris. In both E. coli and 
P. pastoris, NHase activity was obtained only when 
P14K was co-expressed with the NHase genes. The 
preference for hydrolysis of S-nitriles (stereo- 
specificity) observed in the native organism was also 
demonstrated in the recombinant orgamisms producing 
•active NHase. 

E XPRESS TON OT THE NT TRTT.E hydf atasf. fn7.vmf. ftN P 

CONVERSION OF SUBSTPRTfrg; . 

35 Transformed E . coli cells harboring plasmid pSW2 

under the control of the IPTG inducible Ptac promoter, 
were grown under standard conditions and induced to 
express the nitrile hydratase enzyme. Cells were 
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harvested and lysed and the protein was detected in 
crude lysates by SDS-polyacrylamide ael 
electrophoresis followed by western blot analysis 
(Egger et al . , Mol . Blotechnol., 1(3), 289-305 (1994)) 
5 using antisera raised against NRRL-I8668 NHase protein 
(Figure 5) . Under these conditions induced cells 
produced approximately 10-fold as much nitrile 
hydratase protein as uninduced cells. Nitrile 
hydratase was not detected from a control strain 
10 harboring the vector pMMB207 without the 6.5 kb 
insert . 

Nitrile hydratase is typically confirmed by 
incubating a suitable substrate nitrile in the 
presence of the crude or purified enzyme. Suitable 

15 substrates for the instant hydratase include a variety 
of racemic alkyl nitriles such as methacrylonitrile, 
methylbutyronitrile and propionit rile . In the instant 
case, nitrile hydratase activity was confirmed by 
monitoring the conversion of methacrylonitrile to the 

20 corresponding amide. Induced cells harboring the 

plasmid pSW2 showed rapid conversion of methacrylo- 
nitrile, while induced cells without the pSW2 plasmid 
showed no conversion of methacrylonitrile.. 
Additionally, induced cells harboring the plasmid pSW5 

25 show no conversion of methacrylonitrile. 

Stereospecif ic activity of the enzyme produced in 
induced cells harboring plasmid pSW2 was confirmed by 
monitoring the conversion of R,S-CFIN to amide 
products using reverse-phase or chiral high pressure 

30 liquid chromatography (HPLC) . Methods of enantiomer 
separation on HPLC are well known in the art. See, 
for example, Mutton, I . , Pract . Approach Chiral Sep., 
Liq. Chromatogr. , 329-55 (1994), Editor (s): 
Subramanian, Ganapathy, Publisher: VCH, Weinheim, 

3 5 Germany. 

IV - CO-EXPRESSION OF NTTRILE HYHRATASr. AMH AMTDASF : 

The present invention further provides a 
transformed microorganism capable of co-expressing 
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both a heterologous nitrile hydrata.se aene and a 
heterologous amidase gene. This transf ormant is 
capable of effecting the conversion of racemic 
mixtures of aryl-2-alkane nitriles to the 
5 corresponding carbo::ylic acids via the amide 
intermediate . 

A number of amidase encoding genes may be 
suitable for co-expression with the instant nitrile 
hydratase. However, the amidase gene isolated from 
10 Pseudomonas chlororaphis B23 and defined above is 
preferred. 

The gene encoding the Pseudomonas chlororaphis 
B23 amidase is known (Nishiyama, M . J., Bacteriol., 
173:2465-2472 (1991)) and was obtained through PCR 

15 amplification using appropriate primers. The 

amplified gene comprising 1.5 kb was subcloned into a 
pMMB207 plasmid (already containing the nitrile 
hydratase gene) using standard restriction enzyme • 
digestion and ligation techniques (Sambrook supra) to 

20 generate the plasmid pSW17 (Figure 6). The plasmid - 
pSW17 was constructed so as to place the amidase gene 
and the nitrile hydratase gene both under the control 
of the same IPTG inducible Ptac promoter. The plasmid 
pSW17 was then used to transform a suitable host cell 

25 (e.g., E . coli XLl-Blue) according to standard 
methods . y 

In order to confirm the activity of the amidase 
produced in cells transformed with plasmid pSW17, 
cells transformed by plasmid pSW17 -ere grown up and 
30 induced with IPTG in the presence of a suitable 

nitrile and the chiral amide and free acid products 
were identified by chiral HPLC analysis. 

> The following Examples are meant to illustrate 
the invention but should not be construed as limiting 
35 it in any way. 
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EXAMPLE 1 

ISOLATION . PURIFICATION. AND AMINO ACID SE QUENCING 
OF PORTIONS OF THE NITRILE HYDRATASE a AND ft SUBUNITS 

PsBudomonas pvtida (NRRL-18668) was cultured in a 
5 medium (10 g/L glucose, 8.7 g/L K 2 HFC 4/ 6.8 g/L KH 2 P0 4 , 
2.0 g/L acetonitrile, 1.85 g/L NaNO^ 0.50 g/L 
MgS0 4 *7H 2 0, 0.050 g/L FeS0 4 *7H 2 0, 0.30 mg/L MnCl 2 *4H 2 0, 
0.10 mg/L H3BO3, 0.050 mg/L NiS0 4 *6H 2 0, 0.050 mg/L 
CuS0 4 »5H 2 O, 0.050 mg/L Co (N0 3 ) 2 • 6H 2 0 , 0.030 mg/L 

10 Na 2 Mo0 4 *2H 2 0, 0.030 mg/L ZnS0 4 *4H 2 0, 0.020 mg/L KI, 

0.020 mg/L KBr, 0.010 mg/L pyridoxine - HC1 , 0.0050 mg/L 
thiamine^HCl, 0.0050 mg/L D-pantothenate f Ca 2 + salt, 
0.0050 mg/L riboflavin, 0.0050 mg/L nicotinic acid, 
0.0050 mg/L p-aminobenzoic acid, 0.0020 mg/L biotin, 

15 0.0020 mg/L vitamin B 12 , 0.0020 mg/L folic acid, 

pH 7.0) at 30°C for 48 h. The bacterial cells were 
harvested. 100 g of the bacterial cells were 
disrupted and the cell free extract fractionated with 
ammonium sulfate. The ammonium sulfate fractionation 

20 precipitate was dissolved in buffer and loaded on a 
Phenyl Sepharose CL-4B chromatography column 
(Pharmacia Biotech, Uppsala, Sweden) , followed by a 
DEAE-cellulose chromatography column, and a second 
DEAE-cellulose chromatography column (Whatman, 

25 Maidstone, England) . Active fractions were pooled and 
concentrated. The concentrate containing the enzyme 
was loaded on a reverse phase high performance 
chromatography column (Vydac 208TP104) and two 
subunits (a and (3) were obtained. The N-terminal 

30 amino acid sequence of the a- and p-subunits was 
determined using an amino acid sequencer (Beckman 
model LF3000G gas phase protein sequencer, Fullerton, 
CA. The a- and fi-subunits were cleaved separately 
using cyanogen bromide, TPCK-treatec trypsin, and AspN 

35 protease, and the peptides generated were separated on 
a reverse phase high performance chromatography column 
(Vydac 208TP104, The Separations Group, Hesperia, CA) . 
Fractions containing well-resolved peptides were 
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10 



20 



sequenced using the same technique. The sequences of 
the individual peptides were combined into partial 
sequences of the subunits by alignment with the 
published sequences of the a- and p-subunits of 
5 nitrile hydratases from P . chlororaphis B23 [Nishiyama 
et al., J. Bacteriol., 173:2465-2472 (1991) ], 
Rhodococcus N-774 [Ikehata et al., Eur. J. Biochem. , 
181:563-570 (1989)], and Rhodococcus rhodochrous Jl 
[Kobayashi et al . , Biochim. Biophys . Acta, 1129:23-33 
(1991)]. The partial sequences of the of the a- and 
0-subunits of nitrile hydratase from Pseudomonas putida 
(NRRL-18668) were identified as defined in the 
Sequences Listing as SEQ ID NOs.:5-9 and SEQ ID 
NOs * : 10-13, respectively. 

15 EXAMPT.F 7 

PREPARATION OF DNA PKOBF! TOP m? F ,L-1R66R WHASB GEME 
The degenerate oligonucleotide designated Dl as 
defined in the Sequence Listing as SEQ ID NO.:14, and 
the degenerate oligonucleotide designated D7 as 
defined in the Sequence Listing as SEQ ID NO.: 15 were 
used as primers in a polymerase chain reaction (PCR) 
[Mullis, K. B., Meth. Enzymol., 155:335-350 (1987)] 
with NRRL-18668 genomic DNA as target. PCR conditions 
were as follows: 100 ng target, 1 uM each primer, 
25 200 pM each of dATP, dCTP, dGTP, dTTP, 10 mM Tris-HCl 
pH 8.3, 50/ mM KC1, 1.5 mM MgCl 2 , 0.001% gelatin, 
25 U/mL Amplltacf" DNA polymerase (Perkin Elmer Cetus, 
Norwalk, CT) . PCR parameters were as follows: 94°C 
1 min, 55°C 1 min, 72°C 1 min, 40 cycles. One half of 
30 the PCR product was subjected to ethidium bromide 

agarose gel electrophoresis followed by transfer to 
nitrocellulose and Southern analysis with 32 P labeled 
Rhodococcus rhodochrous Jl L-NHase gene as probe 
[Southern, E.M., J. Mol. Biol., 98:503 (1975)]. 
35 Strong hybridization of a DNA fragment of 

approximately 0.7 kb suggested the presence of at 
least a portion of a NHase gene in this PCR product. 
The remaining half of the PCR product was restricted 
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with EcoRl (the primers were designed with EcoRl sites 
at the 5 1 ends) and ligated to Eco?.:. restricted M13 
mpl9 vector DNA. Ligation mi:: was used to transfect 
competent E. coll XLl-Blue which was plated onto LB 
5 plates supplemented with IPTG and M-gal (5-bromo- 

4chloro-3indolyl-p-D-galactopyranoside) [Maniatis, T. , 
Molecular Cloning: A Laboratory Manual (1989)]. 
Phage DNA was prepared from several "white" plaques 
[Maniatis, T., Molecular Cloning: A Laboratory Manual 

10 (1989)] and sequenced by dideoxy termination protocol 
using universal primer [Sanger, F., Science, 
214:1205-1210 (1981)]. Analysis of the nucleotide 
sequence obtained as defined in the Sequence Listing 
as SEQ ID NO.: 16 confirmed that the PCR product 

15 corresponds to part of the NHase gene. 

EXAMPLE 3 
ISOLATION OF GENOMIC DNA FFAGMENT 
CONTAINING NRRT.-1 8668 NHASE GENE 

Total genomic DNA (10 pg) from NRRL-18668 was 

20 isolated [Maniatis, T., Molecular Cloning: A 

Laboratory Manual (1989)], restricted with EcoRl and 
Xhol, and one half subjected to agarose gel 
electrophoresis followed by Southern blot using the 
32 P labeled 0.7 kb fragment described in Example 2 as a 

25 probe [Southern, E. M., J. Mol. Bid., 98:503 (1975)]. 
A strongly hybridizing band of approximately 6.5 kb 
was identified, suggesting that the NHase gene (or 
part of it) resides on this 6.5 kb genomic DNA 
fragment. A duplicate agarose gel was run and a gel 

30 slice from the 6.5 kb region was encised. DNA 

extracted from the gel^slice isolated [Maniatis, T., 
Molecular Cloning: A Laboratory Manual (1989)] was 
ligated to lambda DNA restricted with EcoRl and Xhol. 
The ligation mix was packaged into phage particles and 

35 used to transfect E, coll XLl-Blue according to the 
manufacturer's instructions [Stratagene, La Jolla, 
CA] . Several thousand plaques were screened using the 
32 P-labeled 0.7 kb fragment as probe [Maniatis, T., 
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Molecular Cloning: A Laboratory Manual (1989)]. One 
positively hybridizing plaque was subsequently 
purified. 

EXAMPLE 4 

5 CONSTRUCTION OP PT.a^jp 

CONTAINING NRRL-IR ^fl r.r. W 
DNA from the purified phage plaque described in 
Example 3 was excised and converted to a pBluescript- 
based plasmid according the the manufacturer's 
10 instructions [Stratagene, La Jolla, CA] , and 

designated pSWl . The plasmid pSWl has a 6.5 kb insert 
containing the NRRL-18668 NHase gene as described in 
Fig. 1. 

example 5 

15 TRANSFORMATION- O F HO?T T3V 

PLASMID CONTAINING WPRT.- 18668 NHASE ^ENF. 
The plasmid pSWl described in Example 4 was used 
to transform competent E. coli XLl-Blue cells by the 
CaCl 2 method [Maniatis, T., Molecular Cloning: A 
20 Laboratory Manual (1989)]. 

EXAMPLE g 

RECOMBINANT PLASMID PT7RTF jr. atton anh 
CONSTRUC TION OF RESTRICTION map fop 
G E NOMIC DNA FRAGMENT CONTAINING wm-lfifi 68 NHASE QEHZ 

25 Plasmid DNA purified by the alkaline lysis method 

[Maniatis,/ T. f Molecular Cloning: A Laboratory Manual 
(1989)] from JET. coli cells harboring plasmid pSWl, 
described in Example 5, was restricted with EcoRl, 
Pstl, Kpnl, Hind3, and Xhol singly or in various 

30 combinations, followed by agarose gel analysis, and 
Southern analysis using the 0.7 kb PCR product 
described in Example 2 as a probe [Southern, E.M., J m 
Mol. Biol., 98:503 (1975)]. A restriction map 
constructed for the 6.5 kb insert fragment of the 

35 plasmid pSWl, including the location of the NHase gene 
is shown in Fig. 2. 
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EXAMP LE 7 

DNA SEQUENCING OF NRRL-1 8 6 6Q N HASE HENE 

Based on the restriction map described in 
Example 6, the nucleotide sequence of a fragment of 
5 DNA encompassing the NHase gene was determined by the 
Sanger dideoxy method [Sanger, F., Science, 
214:1205-1210 (1981)] using double-stranded plasmid 
DNA as template. The nucleotide sequence and the 
corresponding predicted amino acid sequences for the a 
10 and P peptides are defined in the Sequence Listing as 
SEQ ID NO.: 17 and Fig. 7. 

EXAMPLE 8 

CONSTRUCTION OF NRRL-18668 NHASE EYPPF. SSTON VEPTOP 
Plasmid pSWl was restricted with EcoRl and Xhol 

15 and the 6.5 kb fragment was ligated to the wide host 

range plasmid pMMB207 [Bagdasarian, M., Gene, 97:39-47 
(1991)] restricted with EcoRl and Sal 1 to generate 
the plasmid designated pSW2 and shown in Fig. 3. The 
2.8 kb Pstl DNA fragment containing the NRRL-18668 

20 NHase gene was excised from plasmid pSW2 by digestion 
with Pstl restriction enzyme and ligated* into the Pstl 
site of vector pMMB207 to generate the plasmid 
designated pSW5 and shown in Fig. 4. 

EXAMPLE q 

25 CONSTRUCTION OF NRRL-18668 NHASE EXPRESSION STRAIN 

Plasmids pSW2 and pSW5 described in Example 8 
were used to transform competent E. coll XLl-Blue 
cells which were plated onto LB plates supplemented 
with 12.5 pg/mL chloramphenicol [Maniatis, T., 
30 Molecular Cloning: A Laboratory Manual (1989)]. 

EXAMPLE 10 

EXPRESSION OF NRRL-18668 N HASE PROTETN 

E. coll cells harboring plasmid pSW2, described 
in Example 8A, were grown in SOC media (0.5 g/L NaCl, 
35 20 g/L bacto-tryptone, 5 g/L bacto-yeast extract, 
20 mM glucose, 2.5 mM KC1, 10 mM MgCl 2 ) at 37°C to 
OD600=0.5, followed by induction at 30°C by the 
addition of IPTG to 1 mM. After induction times 
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20 



25 



30 



ranging from 0.5 h to 3 h, cells were harvested by 
centrifugation, and suspended in 1/10 volume PBS 
(8.0 g/L NaCl, 0.2 g/L KC1, 1.44 g/L Na 2 HP0 4 , 0.24 g/L 
KH 2 P0 4 pH 7.4) . A cell suspension equivalent to 0.05 
5 OD600 units is added to an equal volume of 2X SDS gel- 
loading buffer (100 mM Tris pH 6.8, 200 mM DTT, 4% 
SDS, 0.2% bromophenol blue, 20% glycerol), boiled for 
5 min, and analyzed by SDS PAGE [Laemmli, U.K., 

Nature, 227:680-685 (1970)] followed by western blot 
10 [Towbin, H., Proc. Natl. Acad. Sci . . 76:4350-4354 

(1979)] using antisera raised against NRRL-18668 NHase 

protein. A positive signal was obtained at 

approximately 2 8 kd and corresponded to purified NHase 

protein as shown in Fig. 5. 

15 EXAMPT.F. 1 1 

EXPRESS TON OF ACTTvr. M^p T .-.asfift ot»«;f 
E. coll cells harboring plasmid pSW2 , described 
in Eample 9, were grown and induced as described in 
Example 9 in a 500 mL batch. Cells were harvested by 
centrifugation and washed with pH 7.2, 0 . 1M phosphate 
buffer (KH 2 P0 4 adjusted with 50% NaOH) containing 15% 
glycerol. Washed cells were stored frozen at -70°C. 
Washed and frozen E. coll cells harboring the pSW2 
plasmid and were suspended in 100 mM phosphate buffer, 
pH 7, at a cell density of O.D. 490 = 0 .62. Methacrylo- 
nitrile was added to a final concentration of 10 mM 
and the mixture was shaken at 250 rpm at room 
temperature. Analysis of supernatant: showed that 
methacrylonitrile was rapidly converted to hydrolysis 
products after 30 min. Cells without the pSW2 plasmid 
showed no activity. 

EXAMPT.F 1 ? 
PRODUCT TON OF r HT RAT. ^.MTDFS 

Induced E . coll cells harboring the pSW2 plasmid 
35 and producing stereospecific nitrile hydratase 

activity as described in Example 11 were suspended in 
100 mM phosphate buffer, pH 7, and a concentration of 
50 mg/mL. One milliliter of this suspension was 
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placed in a glass vial containing 19.3 mg of R,S-CPIN. 
The suspension was shaken at 250 rpm on a rotary 
shaker at room temperature for 68 h. Analysis by 
chiral HPLC reveals only the S-CPIAm was produced from 
5 the R, S-CPIN. 



mg nitrile 



10 



15 



20 



25 



30 



mg amide 



Time, h 



R-CPIN 



S-CPIN 



R~CPIAm 



S-CPIAm 



0 

68 



9.6 
9.6 



9. 6 
5.5 



4.5 



EXAMPT.F 1^ 

CONSTRUCTION OF A VECTOR FOR po-fvpp .esston or mp FT - 
18668 NHASF, 

The amidase gene from Pseudomonas chlororaphls 
B23 (defined as SEQ ID NO.:20) was obtained through 
PCR amplification using primers with overhanging 5' 
EcoRl sites as defined in the Sequence Listing as SEQ 
ID NO.: 18 and SEQ ID NO.: 19. This 1.4 kb DNA fragment 
containing the B23 amidase gene was digested with 
EcoRl restriction enzyme and ligated into the EcoRl 
site of pMMB207, and the 5 . 0 kb EcoRl/Hindlll DNA 
fragment from pSWl, described in Example 4, was 
subcloned between the Xbal and Hindlll to generate the 
plasmid pSW17 as shown in Fig. 6. 

EXAMPLE 14 

CONSTRUCTION OF STRAIN FOR m-RXPttr." t o n or NPPT-is^ft 
NHASE AND PFEUDOMONAS QELQEQEAEJiZS. amtdasr 
Plasmid pSW17 described in Example 13 was used to 
transform competent E. coli XLl-Blue cells which were 
selectively grown on LB plates supplemented with 
12.5 pg/mL chloramphenicol [Maniatis, T. f Molecular 
Cloning: A Laboratory Manual (1989) ] 

EXAMPLE T> 

COMPARISON OF NHase ACTIVITY rpn M pSW? and r .g W g 
E. coll cells harboring the pSW2 or pSW5 plasmid 
and induced according to the protocol in Example 11 
were each suspended separately in 100 mM phosphate 
buffer, pH 7, at a concentration of 20 mg/mL. 
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Butyronitrile was added to each suspension to a final 
concentration of 10 mM. The suspensions were shaken 
at 250 rpm on a rotary shaker at room temperature for 
24 h. At the end of the incubation period, 0.1% 
5 phosphoric acid was added to the suspensions, bringing 
them to a pH of 2-3 and stopping nitrile hydratase 
activity. Cells were removed from the suspension by 
centrifugation . Analysis of the reactions showed the 
following products: 

pSW2 - 94% butyramide, 6% butyronitrile; 
pSW5 - <l% butyramide, 100% butyronitrile. 

EXAMPT.F. If 

PRODUCTION OF S-CPTAM AND z-r*r A FROM p,s-rPT N 
E. coll cells harboring the pSW17 and induced 
15 according to the protocol in Example 11 were suspended 
in 100 mM phosphate buffer, pH 7 , at a concentration 
of 100 mg/mL. One milliliter of this suspension was 
placed in a glass vial containing 19.3 mg of R,S-CPIN 
dispersed in a dry form on 0 . 5 g of 0 . 5 mm glass 
beads. The suspension was shaken in a 20 mL 
scintillation vial at 250 rpm on a rotary shaker at 
room temperature for 68 h. Analysis by chiral HPLC 
reveals both S-CPIAm and the S-CPIA were produced from 
the R, S-CPIN. 



20 



mg nitrile mg amide mg acid 



30 



Time ' h R-CPIN S-CPIN R- CP I Am S-CPIAm R-CPIA S-CPIA 



0 9-6 9.6 0 0 0 0 

68 9 - 6 8-4 0 0.84 0 0.42 

25 EXAMPT.F. 11 

Nuc l eotide ^eouenrina of dna regions 
f l anking nrrt.-i.B66R HH a ^s 

The nucleotide sequences of DNA regions flanking 
the NRRL-18668 NHase were determined by the Sanger 
dideo::y method (Sanger, F. (1981) Science 
214:1205-1210) using double-stranded plasmid DNA as 
template. Using pSWl (Fig. l) as template, the 
nucleotide sequence downstream of NHase, down to the 
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Xhol site (Fig. 2), was determined. This sequence 
contains at least one gene, and potentially several 
more, which is defined as P14K, the nucleotide 
sequence of which is defined in Sequence Listing SEQ 
5 ID NO.:21, and the deduced amino acid sequence is 
defined in Sequence Listing SEQ ID NO.:22. P14K is 
required for NHase activity as described below. 

The nucleotide sequence upstream of NHase, up to 
the EcoRl (Fig. 2), was determined using pSWl (Fig. 1) 

10 as template. The nucleotide sequence further upstream 
of the EcoRl site was determined after subcloning DNA 
fragments corresponding to this region as follows. 
NRRL-18668 genomic DNA was digested with Pstl and then 
self-ligated. Oligo-nucleot ide primers designed to 

15 bind 3' to EcoRl heading upstream (Fig. 2) and 5' to 
Pstl heading downstream (Fig. 2), and defined as 
Sequence Listing SEQ ID NO.:23 and Sequence Listing 
SEQ ID NO.:24 r respectively, were used in a PCR 
reaction to amplify a 0.8 kb fragment corresponding to 

20 DNA upstream of the EcoRl site (Fig. 8). NRRL-18668 
genomic DNA was digested with EcoRl and then self- 
ligated. Oligo-nucleotide primers designed to bind 3 f 
to Pstl heading upstream (Fig. 8) and 5* to EcoRl 
heading downstream (Fig. 8), and defined as Sequence 

25 Listing SEQ ID NO.:25 and SEQ ID NO.:26, respectively, 
were used in a PCR reaction to amplify a 0.7 kb 
fragment corresponding to DNA upstream of the Pstl 
site (Fig. 9) . By subcloning and sequencing the PCR 
fragments, the nucleotide sequence upstream of NHase, 

30 up to the EcoRl site (Fig. 9) was determined. This 
sequence contains at least one gene, and potentially 
more, which has been identified as encoding an amidase 
(based on homology to other amidase sequences), the 
nucleotide sequence of which is defined as Sequence 

35 Listing SEQ ID NO.:27, and the deduced amino acid 

sequence defined as Sequence Listing SEQ ID NO.:28. 

A compiled map of the entire 8.0 kb DNA fragment, 
indicating genes identified, is shown in Fig. 10. 
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Construction Of nlasmids f r ;; QXPrP.^inn 
Of NRRL-18668 NHssp H n Pir/nix pa^nrf.^ 
The 0.9 kb EcoRl/Xbal fragment in pHIL-D4 
5 (Phillips Petroleum, Bartlesville , OK) was replaced by 
the 0.9 kb EcoRl/Xbal fragment from pA0815 
(Invitrogen, San Diego, CA) to generate the plasmid 
pHIL-D4B2 (Fig. 11) which contains ^he following 
elements: 5 ' AOX1 , P. pastorls methanol inducible 

10 alcohol oxidase I (AOX1) promoter; AOX1 term, 

P. pastorls AOX I transcriptional termination region; 
HIS4, P. pastorls histidinol dehydrogenase-encoding 
gene for selection in hls4 hosts; kan, sequence 
derived from transposon Tn903 encoding aminoglycoside 

15 3 '-phosphotransferase, conferring kanamycin, neomycin 
and G418 resistance in a wide variety of hosts, and 
useful as an indicator of cassette copy number; 
3'AOXl, P. pastorls sequence downstream from AOX1 , ■ 
used in conjunction with 5'AOXl for site-directed 

20 vector integration; orl r pBR322 origin of DNA 

replication allowing plasmid manipulations in E. coll; 
and amp, p-lactamase gene from pBR322 conferring 
resistance to ampicillin. An additional feature of 
pHIL-D4B2 is that multiple expression cassettes 

25 (5'AOXl - gene - AOXlterm) can easily be placed into 
one plasmid by subcloning cassettes on Bgl2/Xbal 
fragments into BamHl/Xbal sites. 

The genes encoding a, p, and F14K (Fig. 10) were 
PCR amplified using primers with EcoRl sites at the 5' 

30 ends. The PCR products were digested with EcoRl , and 
subcloned into the EcoRl site of pHIL-D4B2 to generate 
pSW46 (Fig. 12), pSW47 (Fig. 13) and pSW48 (Fig. 14), 
respectively. The Bgl2/Xbal fragment from pSW47 
containing the p expression cassette was subcloned into 

35 the BamHl/Xbal sites of pSW46 to generate pSW4 9 

(Fig. 15), which contains expression cassettes for a 
and P. The Bgl2/Xbal fragment from pSW48 containing 
the P14K expression cassette was subcloned into the 
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BamHl/Xbal sites of pSW49 to generate pSWSO (Fig. 16), 
which contains expression cassettes for ot f p and F14K. 

EXAMPLE 19 

Construction of Pichia pasr.zris strain 
5 for expression of NRRT, -18f6fl NHasft 

P . pastoris straih GTS115 (his4) (Phillips 
Petroleum, Bartlesville, OK) was transformed with 
1-2 ng of Bgl2-linearized plasmid pSW4 9 or 1-2 ng of 
Bgl2-linearized plasmid pSW50 using the spheroplast 

10 transformation method as described 'Gregg et al . 

(1985) Mol. Cell. Biol. 5: 3376-3385). Cells were 
regenerated on plates without histidine for 3-4 d at 
30°C. All trans formants arise after integration of 
plasmid DNA into the chromosome. Chromosomal DNA was 

15 prepared from his + transf ormants and. subjected to PCR 
analysis with primers specific for a, (3 and P14K 
genes. An isolated pSW4 9 transf ormant positive for a 
and (3 genes, and an isolated pSWSO transf ormant 
positive for a, p and P14K genes, designated SW4 9 and 

20 SW50.2, respectively, were selected for further study, 
P. pastoris strain SW50.2 was deposited with ATCC and 
assigned accession number ATCC 74391. 

EXAMPLF. 9 0 

NRRL-18668 NHase activity in en^ neergri P. pastoris 
25 P. pastoris strains SW49 and SW50.2 were grown to 

A 600 of 2-10 in MGY (1.34% yeast nitrogen base without 
amino acids, 0.00004% biotin, 1% glycerol) with 
shaking at 30 °C. Cells are then pelleted and induced 
by resuspending in MM (1.34% yeast nitrogen base 
30 without amino acids, 0.00004% biotin, 0.5% methanol) 
and incubated with shaking at 30 °C for 1-4 d. Cells 
were harvested by centrif ugation and washed in PBS 
(0.1 M KH 2 P0 4 , pH 7.2). NHase activity was 
demonstrated by methacrylonitrile assay, in which 
35 cells were resuspended in PBS at A 600 of 0.6, and 

methacrylonitrile was added to a final concentration 
of 10 mM. After incubation with shaking at room 
temperature, conversion of methacrylonitrile to 
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methacrylamide by NHase was demonstrated by monitoring 
the increase in A 22 4 of the supernatant. Cells boiled 
before assay serve as a negative control. NHase 
activity was observed in SW50.2 which harbors 
5 expression cassettes for a, £ and ?14K, while SW49, 
which only harbors expression cassettes for a and 0 
showed negligible NHase activity. 

A 224 

rxn t i me, m i n SW5Q , ? r swso.? Hoi ^ 

0 0.260 0.360 0.110 

15 0.360 1.390 0.125 

10 Stereospecif ic NHase activity was also 

demonstrated in induced SW50.2 cells by using R-2-<4- 
chlorophenyl) -3-methylbutyronitrile (R-CPIN) or S-2- 
(4-chlorophenyl) -3-methy lbutyronitriie (S-CPIN) as 
substrate and and then analyzing for conversion to the 

15 corresponding amides (R-CPIAm and S-CPIAm, 
respectively) by HPLC. 

mM 

rsn t i me, h R-CPIN R-CPIAm s-cptn s-cptahi 
0 10 0 10 o 

48 10 0 5.5 4.5 

Bioconversion of adiponitrile (ADN) to 
20 5-cyanovaleramide (5-CVAm) was also demonstrated in 
permeabilized SW50.2 cells, and in SW50.2 cell 
extracts. Permeabilized cells were prepared by the 
addition of benzalkonium chloride (Lonza Baequat 
MB-50) to a 10% (wt) suspension of induced cells to 
25 yeild 1% (wt MB-50:wt cells). The suspension was then 
mixed on a nutator mixer for 60 min at room 
temperature, after which cells were washed by 
centrifugation 3 times with 50 mM phospahte buffer, pH 
7.0. Extracts were prepared by rapidly vortexing 
30 induced cells with 0 . 5 mm glass beads (BioSpec 

Products) in 50 mM KH 2 P0 4 , pH 7.0/1 mM EDTA/0.1 mM PMSF 
for 2 min. NHase activity was determined to be 
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34-38 U/g wet wt (permeabilized cells), and 35-56 U/g 
wet wt (cell extracts) . 

EXAMPLE 21 

Construction of plasmid for oppression of 
5 NRRL-18668 amidasft in F. . nnli 

The gene encoding NRRL-18688 amidase was PCR 
amplified using an upstream primer with a Hind3 site 
at the 5 T end and a downstream primer with an Xhol 
site at the 5' end. The PCR product was subcloned 
10 into the vector pET-21a<+) (Novagen, Madison, WI) 
between the Hind3 and Xhol sites to generate the 
expression plasmid pSW37 (Fig. 17) . 

EXAMPLE 22 
Construction of EL coli --rain for 
15 expression of NRRL-186 6S amidase 

E . coli strain BL21 (DE3) (Novagen, Madison, WI) 
was transformed with pSW37 using the calcium chloride 
procedure (Maniatis et al . (1989) Molecular Cloning: 
A Laboratory Manual) , and an isolated transformant was 
20 designated SW37, and deposited with ATCC and assigned 
accession number ATCC 98174. Induced SW37 shows 
production of amidase enzyme based on Coomassie Blue 
stained denaturing polyacrylamide gel electrophoresis 
of soluble cell extract. 
25 EXAMPLE ?^ 

NRRL-1866B amidase activity in engineered E . coli 
E. coli strain SW37 is grown in LB media at 30 °C 
to A 600 =0.5, at which time IPTG is added to 1 mM and 
incubation continued for 2 h. Cells are then pelleted 
30 and washed in PBS. Cells are incubated with 10 mM 

butyramide and conversion to butyric acid is monitored 
by HPLC. 

EXAMPLE ?A 
Construction of plasmid for expression 
35 of NRRL-18668 amidase a nd NHase in E . mil 

The entire 8.0 kb DNA fragment (shown in Fig. 10) 
was subcloned between the EcoRl and Xhol sites of the 
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vector pET-21(+) (Novagen, Madison, WI) to generate 
the plasmid pSW23 (Fig. 18) . 

EXAMPLE 7S 

Construction of E : noli .^rain for 
CP-express i on Of NRRL-1B66R amida.s^ . n d NHaso 
E. coli strain BL21 (DE3) (Novagen, Madison, WI ) 
was transformed with pSW23 using the calcium chloride 
procedure (Maniatis et al. (1989) Molecular Cloning: 
A Laboratory Manual), and an isolated transformant was 
designated SW23, and deposited with ATCC and assigned 
accession number ATCC 98175. Induced SW23 shows 
production of NHase enzyme and amidase enzyme based on 
Coomassie Blue stained denaturing polyacrylamide gel 
electrophoresis of soluble cell extract. 

EXAMPT.F, ?fi 
NRRL-18668 amidasp and NHase activity 

in enainfifirpd E . noli 
E. coli strain SW23 is grown in LB media at 30 °C 
to A €00 =0.5, at which time IPTG is added to 1 mM and 
incubation continued for 2 h. Cells are then pelleted 
and washed in PBS. Cells are incubated with 10 mM 
butyronitrile and conversion to butyric acid is 
monitored by HPLC. Stereospecif ic conversion of 
S-CPIN, relative to R-CPIN, to the corresponding acid 
(S-CPIAc) can also be monitored by HPLC. 
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SEQUENCE IjTfiTTN^ 
(1) GENERAL INFORMATION: 

(i) APPLICANT: 

<A) NAME: E. I. DU PONT DE NEMOURS AND COMPANY 

(B) STREET: 1007 MARKET STREET 

(C) CITY: WILMINGTON 

(D) STATE: DELAWARE 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE (ZIP) : 19898 

(G) TELEPHONE: 302-892-8112 

(H) TELEFAX: 302-773-0164 

(I) TELEX: 6717325 

(ii) TITLE OF INVENTION: NUCLEIC ACID FRAGMENTS 

ENCODING STEREOSPECIFIC 
NITRILE H YD RAT AS E AND 
AMID AS E ENZYMES AND 
RECOMBINANT ORGANISMS 
EXPRESSING THOSE ENZYMES 
USEFUL FOR THE 
PRODUCTION OF CHIRAL 
AMIDES AND ACIDS 

(iii) NUMBER OF SEQUENCES: 28 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: FLOPPY DISK 

<B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: MICROSOFT WINDOWS 3.1 

<D) SOFTWARE: MICROSOFT WORD 2.0C 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

<A) APPLICATION NUMBER: 60/004914 
(B) FILING DATE: OCTOBER 5, 19 95 

(vii) ATTORNEY /AGENT INFORMATION : 

(A) NAME: FLOYD, LINDA A. 

(B) REGISTRATION NO.: 33,692 

(C) REFERENCE /DOCKET NUMBER: CR-9677-A 
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(2) INFORMATION FOR SEQ ID NO.:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.:l: 

Met Gly Gin Ser His Thr His Asp His His His Asp Gly Tyr Gin Ala 
15 10 is 

Pro Pro Glu Asp He Ala Leu Arg Val Lys Ala Leu Glu Ser Leu Leu 

20 25 30 

He Glu Lys Gly Leu Val Asp Pro Ala Ala Met Asp Leu Val Val Gin 
35 40 45 

Thr Tyr Glu His Lys Val Gly Pro Arg Asn Gly Ala Lys Val Val Ala 
50 55 60 

Lys Ala Trp Val Asp Pro Ala Tyr Lys Ala Arg Leu Leu Ala Asp Ala 
65 7 0 75 F 80 

Thr Ala Ala He Ala Glu Leu Gly Phe Ser Gly Val Gin Gly Glu Asp 

85 90 " 95 

Met Val He Leu Glu Asn Thr Pro Ala Val His Asn Val Phe Val Cvs 

100 105 no 

Thr Leu Cys Ser Cys Tyr Pro Trp Pro Thr Leu Gly Leu Pro Pro Ala 
115 120 125 

Trp Tyr Lys Ala Ala Ala Tyr Arg Ser Arg Met Val Ser Asp Pro Ara 
130 135 i 4 o ^ 

Gly Val Leu ,Ala Glu Phe Gly Leu Val He Pro Ala Asn Lys Glu He 
145 150 155 160 

Arg Val Trp Asp Thr Thr Ala Glu Leu Arg Tvr Met Val Leu Pro Glu 

165 170 ' 175 

Arg Pro Gly Thr Glu Ala Tyr Ser Glu Glu Gin Leu Ala Glu Leu Val 

180 185 190 

Thr Arg Asp Ser Met He Gly Thr Gly Leu Pro Thr Gin Pro Thr Pro 
195 200 205 

Ser His 
210 

(2) INFORMATION FOR SEQ ID NO.:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNES S : unknown 

( D ) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : No 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO . : 2 : 

Met Asn Gly lie His Asp Thr Gly Gly Ala His Gly Tyr Gly Pro Val 
1 5 10 15 

Tyr Arg Glu Pro Asn Glu Pro Val Phe Arg Tyr Asp Trp Glu Lys Thr 

20 25 30 

Val Met Ser Leu Leu Pro Ala Leu Leu Ala Asn Ala Asn Phe Asn Leu 
35 40 45 

Asp Glu Phe Arg His Ser lie Glu Arg Met Glv Pro Ala His Tyr Leu 
50 55 60 

Glu Gly Thr Tyr Tyr Glu His Trp Leu His Val Phe Glu Asn Leu Leu 
65 70 75 80 

Val Glu Lys Gly Val Leu Thr Ala Thr Glu Val Ala Thr Gly Lys Ala 

85 90 95 

Ala Ser Gly Lys Thr Ala Thr Arg Val Leu Thr Pro Ala lie Val Asp 

100 105 110 

Asp Ser Ser Ala Pro Gly Leu Leu Arg Pro Glv Gly Gly Phe Ser Phe 
115 120 " 125 

Phe Pro Val Gly Asp Lys Val Arg Val Leu Asn Lys Asn Pro Val Gly 
130 135 140 

His Thr Arg Met Pro Arg Tyr Thr Arg Ala Lys Trp Gly Gin Trp Ser 
145 150 155 160 

Ser Thr Met Val Cys Phe Val Thr Pro Asp Thr Ala Ala His Gly Lys 

165 170 175 

Gly Glu Gin Pro Gin His Val Tyr Thr Val Ser Phe Thr Ser Val Glu 

180 185 190 

* 

Leu Trp Gly Gin Asp Ala Ser Ser Pro Lys Asp Thr lie Arg Val Asp 
195 200 205 

Leu Trp Asp Asp Tyr Leu Glu Pro Ala 
210 215 

(2) INFORMATION FOR SEQ ID NO.: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) H YPOTHET I C AL : No 
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(xi) SEQUENCE DESCRIPTION: SEQ 


ID NO . : 3 : 






AT GGGGC AAT 




TP.APPAPPAT 


Ln^bnLibbb x 


ACCAGGCACC 


GCCCGAAGAC 


60 


ATCGCGCTGC 


GGGTCAAGGC 


PTT<^GAP/rPT 


\-l bcl 0>i 1 Lb 


AGAAAGGTCT 


TGTCGACCCA 


120 


GCGGCCATGG 


ACTTGGTCGT 


CP AAAPGT AT 


brlnv.AUAAbb 


TAGGCCCCCG 


AAACGGCGCC 


180 


AAAGTCGTGG 


CCAAGGPPTG 


GCSTHfiAPPPT 


1 AtAAbb 


CCCGTCTGCT 


GGCAGACGCA 


240 


ACTGCGGCAA 


TTP^PP-APPT 


bVjbL X1L1 — V. 


ubbb I AUAbb 


GCGAGGACAT 


GGTCATTCTG 


300 


GAAAACACCC 






bTTT G CACCT 


TGTGCTCTTG 


CTACCCATGG 


360 


CCGACGCTGG 


nrTTcrrrrr 


1 bLL x bo X AL 




CCTACCGGTC 


CCGCATGGTG 


420 


AGCGACCCGC 


\j x btj(j(j J. TCT 


CbCGGAGTTC 


GGCCTGGTGA 


TCCCCGCCAA 


CAAGGAAATC 


480 


CGCGTCTGGG 


ACACCACGGC 


CGAATTGCGC 


TACATGGTGC 


TGCCGGAACG 


GCCCGGAACT 


540 


GAAGCCTACA 


GCGAAGAACA 


ACTGGCCGAA 


CTCGTTACCC 


GCGATTCGAT 


GATCGGCACC 


600 


GGCCTGCCAA 


CCCAACCCAC 


CCCATCTCAT 


TAA 






633 


(2) 


INFORMATION 


FOR SEQ ID 


NO . : 4 : 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: No 





(xi) SEQUENCE DESCRIPTION: SEQ 


ID NO . : 4 : 






ATGAATGGCA 


TTCACGATAC 


TGGCGGAGCA 


CATGGTTATG 


GGCCGGTTTA 


CAGAGAACCG 


60 


AACGAACCCG 


TCTTTCGCTA 


CGACTGGGAA 


AAAACGGTCA 


TGTCCCTGCT 


CCCGGCCCTG 


120 


CTCGCCAACG 


CGAACTTCAA 


CCTCGATGAA 


TTTCGGCATT 


CGATCGAGCG 


AATGGGCCCG 


180 


GCCCACTATC 


TGGAGGGAAC 


CTACTACGAA 


CACTGGCTTC 


ATGTCTTTGA 


GAACCTGCTG 


240 


GTCGAGAAGG 


GTGTGCTCAC 


GGCCACGGAA 


GTCGCGACCG 


GCAAGGCTGC 


GTCTGGCAAG 


300 


ACGGCGACGC 


GCGTGCTGAC 


GCCGGCCATC 


GTGGACGACT 


CGTCAGCACC 


GGGGCTTCTG. 


360 


CGCCCGGGAG 


GAGGGTTCTC 


TTTTTTTCCT 


GTGGGGGACA 


AGGTTCGCGT 


CCTCAACAAG 


420 


AAC CCGGTGG 


GCCATACCCG 


CATGCCGCGC 


TACACGCGGG 


CAAAGTGGGG 


ACAGTGGTCA 


480 


TCGACCATGG 


TGTGTTTCGT 


GAC GCCGG AC 


ACCGCGGCAC 


ACGGAAAGGG 


CGAGCAGCCC 


540 


CAGCACGTTT 


ACACCGTGAG 


TTTCACGTCG 


GTCGAACTGT 


GGGGGCAAGA 


CGCTTCCTCG 


600 


CCGAAGGACA 


CGATTCGCGT 


CGACTTGTGG 


GATGACTACC 


TGGAGCCAGC 


GTGA 


654 
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(2) INFORMATION FOR SEQ ID NO.: 5: 

<i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO . : 5 : 

Gly Gin Ser His Thr His Asp His His His Aso Gly Tyr Gin Ala Pro 
1 5 10 15 

Pro Glu Asp lie Ala Leu Arg Val Lys Ala Leu Glu Ser Leu 

20 25 30 

(2) INFORMATION FOR SEQ ID NO . : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 13 amino acids 
(B) TYPE: amino acid 

< C ) STRANDEDNESS : unknown 

< D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 6: 

Asp Leu Val Val Gin Thr Tyr Glu His Lys Val Gly Pro 
15 10 

<2) INFORMATION FOR SEQ ID NO.:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 7: 

Asn Gly Ala Lys Val Val Ala Lys Ala Trp Val Asp Pro Ala Tyr Lys 
15 10 15 

(2) INFORMATION FOR SEQ ID NO.:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: No 



(xi) SEQUENCE DESCRIPTION: SEQ ID N"0.:8: 

Asp Pro Arg Gly Val Leu Ala Glu Phe Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO.: 9: 

(i) . SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYP OTHET I CAL : No 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO . : 9 : 



Gly Leu Pro Thr Gin Pro Thr Pro Ser His 
15 io 

(2) INFORMATION FOR SEQ ID NO.:10: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 3 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: No 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 10: 



Met Asn Gly lie His Asp Thr Gly Gly Ala His Gly Tyr Gly Pro Val 
15 io 15 

Tyr Arg Glu Pro Asn Glu Pro Val Phe Arg Tvr Asp Trp Glu Lys Thr 

20 25 30 

Val Met Ser Leu Leu Pro Ala Leu Xaa Ala Asn Gly Asn Phe Asn Leu 
35 40 45 

Asp Glu Phe Arg His Ser lie Glu Arg Met Glv Pro Ala His Tyr Leu 
50 55 * 60 

Glu Gly Thr Tyr Tyr Glu His Trp Leu His Val Phe Glu Asn Leu Leu 
65 70 75 80 

Val Glu Lys 

(2) INFORMATION FOR SEQ ID NO.:ll: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
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in ) 



HYPOTHETICAL: 



NO 



(xi) 



SEQUENCE DESCRIPTION: 



SEQ ID MO. :11: 



Gly Glu His Pro Gin His Val Tyr 
1 5 



(2) INFORMATION FOR SEQ ID NO.: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO.: 12: 

Ser Phe Thr Ser Val Glu Leu Trp Gly Gin Asd Ala Ser Ser Pro Lys 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO.: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.:13: 

Val Asp Leu Trp Asp Asp Tyr Leu Glu Pro Ala 
15 10 

(2) INFORMATION FOR SEQ ID NO.: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: No 
(xi) SEQUENCE DESCRIPTION: SEQ ID HC.:14: 
GGAATTCGAY CAYCAYC AY G A 21 
(2) INFORMATION FOR SEQ ID NO.:15: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 
< D ) TOPOLOGY : 1 ine a r 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: No 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO . : 1 5 : 

GGAATTCTTY TCCCARTCRT A 21 

(2) INFORMATION FOR SEQ ID NO . : 1 6 : 

(i) SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 726 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: No 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.:16: 



GAATTCGATC 


ACCATCACGA 


CGGGTACCAG 


GCACCGCCCG 


AAGACATCGC 


GCTGCGGGTC 


60 


AAGGCCTTGG 


AGTCTCTGCT 


GATCGAGAAA 


GGTCTTGTCG 


ACCCAGCGGC 


CATGGACTTG 


120 


GTCGTCCAAA 


CGTATGAACA 


CAAGGTAGGC 


CCCCGAAACG 


GCGCCAAAGT 


CGTGGCCAAG 


180 


GCCTGGGTGG 


ACCCTGCCTA 


CAAGGCCCGT 


CTGCT GGC AG 


ACGCAACTGC 


GGCAATTGCC 


240 


GAGCTGGGCT 


TCTCCGGGGT 


ACAGGGCGAG 


GACATGGTCA 


TTCTGGAAAA 


CACCCCCGCC 


300 


GTCCACAACG 


TCTTCGTTTG 


CACCTTGTGC 


TCTTGCTACC 


CATGGCCGAC 


GCTGGGCTTG 


360 


CCCCCTGCCT 


GGTACAAGGC 


CGCCGCCTAC 


CGGTCCCGCA 


TGGTGAGCGA 


CCCGCGTGGG 


420 


GTTCTCGCGG 


AGTTCGGCCT 


GGTGATCCCC 


GCCAACAAGG 


AAATCCGCGT 


CTGGGACACC 


480 


ACGGCCGAAT 


TGCGCTACAT 


GGTGCTGCCG 


GAACGGCCCG 


GAACTGAAGC 


CTACAGCGAA 


540 


GAACAACTGG 


CCGAACTCGT 


TACCCGCGAT 


TCGATGATCG 


GCACCGGCCT 


GCCAACCCAA 


600 


CCCACCCCAT 


CTCATTAAGG 


AGTTCGTCAT 


GAATGGCATT 


CACGATACTG 


GCGGAGCACA 


660 


TGGTTATGGG 


CCGGTTTACA 


GAGAACC GAA 


CGAACCCGTC 


TTTCGCTACG 


ACTGGGAAAA 


720 


GAATTC 












726 


(2) 


INFORMATION 


FOR SEQ ID 


NO. :17: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1440 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
< D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: No 
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<xi) SEQUENCE DESCRIPTION : SEQ ID NO.:17: 

CGGGAGCGCA ATCTGCAAGG TGGCATTGGC CTTCAGTGTC GATGCCGAGT TGAAGTCGCT 60 

GTACCCCTTT TTTCAACCAC ACCAGGAGAA CCGCACCATG GGGCAATCAC ACACGCATGA 120 

CCACCATCAC GACGGGTACC AGGCACCGCC CGAAGACATC GCGCTGCGGG TCAAGGCCTT 180 

GGAGTCTCTG CTGATCGAGA AAGGT CTTGT CGACCCAGCG GCCATGGACT TGGTCGTCCA 240 

AACGTATGAA CACAAGGTAG GCCCCCGAAA CGGCGCCAAA GTCGTGGCCA AGGCCTGGGT 300 

GGACCCTGCC TACAAGGCCC GTCTGCTGGC AGACGCAACT GCGGCAATTG CCGAGCTGGG 360 

CTTCTCCGGG GTACAGGGCG AGGACATGGT CATTCTGGAA AACACCCCCG CCGTCCACAA 4 20 

CGTCTTCGTT TGCACCTTGT GCTCTTGCTA CCCATGGCCG ACGCTSGGCT TGCCCCCTGC 4 80 

CTGGTACAAG GCCGCCGCCT ACCGGTCCCG CATGGTGAGC GACCCGCGTG GGGTTCTCGC 54 0 

GGAGTTCGGC CTGGTGATCC CCGCCAACAA GGAAATCCGC GTCTGGGACA CCACGGCCGA 600 

ATTGCGCTAC ATGGTGCTGC CGGAACGGCC CGGAACTGAA GCCTACAGCG AAGAACAACT 660 

GGCCGAACTC GTTACCCGCG ATTCGATGAT CGGCACCGGC CTGCCAACCC AACCCACCCC 720 

ATCTCATTAA GGAGTTCGTC ATGAATGGCA TTC AC GAT AC TGGCGGAGCA CATGGTTATG 7 80 

GGCCGGTTTA CAGAGAACCG AACGAACCCG TCTTTCGCTA CGACTGGGAA AAAACGGTCA 840 

TGTCCCTGCT CCCGGCCCTG CTCGCCAACG CGAACTTCAA CCTCGATGAA TTTCGGCATT 900 

CGATCGAGCG AATGGGCCCG GCCCACTATC TGGAGGGAAC CTACTACGAA CACTGGCTTC 560 

ATGTCTTTGA GAACCTGCTG GTCGAGAAGG GTGTGCTCAC GGCCACGGAA GTCGCGACCG 1020 

GCAAGGCTGC GTCTGGCAAG ACGGCGACGC GCGTGCTGAC GCCGGCCATC GTGGACGACT 1080 

CGTCAGCACC GGGGCTTCTG CGCCCGGGAG GAGGGTTCTC TTTTTTTCCT GTGGGGGACA 1140 

AGGTTCGCGT CCTCAACAAG AACCCGGTGG GCCATACCCG CATGCCGCGC TACACGCGGG 1200 

CAAAGTGGGG ACAGTGGTCA TCGACCATGG TGTGTTTCGT GACGCCGGAC ACCGCGGCAC 1260 

ACGGAAAGGG CGAGCAGCCC CAGCACGTTT ACACCGTGAG TTTCACGTCG GTCGAACTGT 1320 

GGGGGCAAGA CGCTTCCTCG CCGAAGGACA CGATTCGCGT CGACTTGTGG GATGACTACC 1380 

TGGAGCCAGC GTGATCATGA AAGACGAACG GTTTCCATTG CCAGAGGGTT CGCTGAAGGA 14 40 
(2) INFORMATION FOR SEQ ID NO.: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: No 
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(xi) SEQUENCE DESCRIPTION: SEQ ID IO.:18: 
GAGGAATTCA TGGCCATTAC TCGCCCTACC C 31 
(2) INFORMATION FOR SEQ ID NO.:19: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.:19: 
GTCGAATTCT CAGAGCGTGC GCCAGTCCAC C 31 
(2) INFORMATION FOR SEQ ID NO.:20: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: No 
(xi) SEQUENCE DESCRIPTION: SEQ ID NC.:20: 



ATGGCCATTA 


CTCGCCCTAC 


CCTCGACCAG 


GTTTTAGACA 


TCCGAACCCA 


GTTGCACATG 


60 


CAACTGACGC 


ACGAACAGGC 


AGCGTCCTAC 


CTGGAACTGA 


TGCAACCGAG 


TTTCGACGCC 


120 


TACGACCTGG 


TCGACGAACT 


GGCTGATTTC 


GTTCCGCCAA 


TACGCTACGA 


CCGCAGTTCA 


180 


GGCTATCGCC 


ATCGGCCATC 


GGCCAAGGAA 


AACCCTCTGA 


ACGCCTGGTA 


CTACCGAACA 


240 


GAAGTGAATG 


GTGCCCGCGA 


AGGCCTGCTG 


GCGGGCAAAA 


CCGTCGCGCT 


CAAAGATAAT 


300 


ATCTCCCTGG 


CAGGCGTCCC 


CATGATGAAC 


GGCGCAGCGC 


CGTTGGAAGG 


CTTCGTCCCG 


360 


GGGTTCGATG 


CCACGGTGGT 


CACCCGCTTG 


CTCGATGCGG 


GGGCGACCAT 


TCTCGGCAAA 


420 


GCCACCTGCG 


AGC ACT ACT G 


C CTTTCAGG A 


GGCAGCCACA 


CCTCCGATCC 


AGCCCCGGTG 


480 


CACAACCCAC 


ATCGCCACGG 


TTATGCCTCT 


GGCGGTTCCT 


CATCAGGCAG 


CGCGGCATTG 


540 


GTTGCGTCCG 


GTGAGGTGGA 


CATCGCCGTG 


GGCGGCGATC 


AAGGCGGCTC 


CATTCGGATC 


600 


CCGTCGGCCT 


TCTGCGGTAC 


CTACGGCATG 


AAGCCCACCC 


ACGGCCTGGT 


GCCCTACACC 


660 


GGCGTCATGG 


CGATTGAAGC 


CACGATCGAT 


CATGTCGGCC 


CCATCACCGG 


TAACGTGCGC 


720 


GACAACGCGC 


TGATGCTGCA 


GGCAATGGCC 


GGTGCAGACG 


GACTCGACCC 


GCGCCAGGCG 


780 


GCGCCTCAGG 


TCGATGACTA 


TTGCAGTTAC 


CTGGAAAAAG 


GCGTGAGCGG 


ACTCAGAATC 


840 


GGGGTGTTGC 


AAGAGGGATT 


CGCGCTTGCT 


AACCAGGACC 


CTCGCGTGGC 


GGACAAAGTG 


900 
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CGCGACGCCA 


TCGCCCGACT 


CGAGGCGTTG 


GGCGCTCATG 


TCGAGCCGGT 


CTCCATTCCC 


960 


GAGCACAACC 


TGGCAGGGTT 


GTTGTGGCAC 


CCCATCGGTT 


GCGAAGGCTT 


GACCATGCAG 


1020 


ATGATGCATG 


GCAACGGCGC 


AGGCTTTAAC 


TGGAAAGGAC 


TTTAC GATGT 


CGGCCTGCTG 


1080 


GACAAACAAG 


CCAGCTGGCG 


CGACGACGCA 


GACCAATTAT 


CCGCGTCGCT 


CAAGCTCTGC 


1140 


ATGTTCGTCG 


GCCAATACGG 


CCTGTCGCGC 


TACAACGGAC 


GCTACTACGC 


CAAGGCCCAG 


1200 


AACCTTGCAC 


GCTTTGCCCG 


GCAGGGATAC 


GACAAAGCGC 


TGCAAACCTA 


TGACCTGCTG 


1260 


GTGATGCCGA 


CCACGCCCAT 


CACGGCCCAA 


CCCCACCCGC 


CAGCGAACTG 


CTCGATCACG 


1320 


GAGTACGTGG 


CTCGCGCGTT 


GGAAATGATC 


GGCAATACCG 


CGCCACAGGA 


CATCACCGGG 


1380 


CATCCGGCCA 


TGTCGATTCC 


GTGTGGCCTG 


CTGGACGGCC 


TGCCCGTCGG 


GCTGATGCTG 


1440 


GTCGCAAAAC 


ACTACGCCGA 


GGGCACGATT 


TACCAAGCGG 


CGGCGGCGTT 


TGAAGCCTCG 


1500 


GTGGACTGGC 


GCACGCTCTG 


A 








1521 



(2) INFORMATION FOR SEQ ID NO.:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 
(B) STRAIN: P14K 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.:21: 

ATGGCCCTGT GTTTGACGAG CCTTGGCAGT CCCAGGCGTT TGCCTTGGTG GTCAGCATGC 60 

ACAAGGCCGG TCTCTTTCAG TGGAAAGACT GGGCCGAGAC CTTCACCGCC GAAATCGACG 120 

CTTCCCCGCT CTGCCGGCGA AAGCGTCAAC GACACCTACT ACCGGCAATG GGTGTCGGCG 180 

CTGGAAAAGT TGGTGGCGTC GCTGGGGCTT GTGACGGGTG GAGAC GTCAA CTCGCGCGCA 240 

CAGGAGTGGA AACAGGCCCA CCTCAACACC CCACATGGGC ACCCGATCCT GCTGGCCCAT 300 

GCGCTTTGCC CGCCAGCGAT CGACCCCAAG CACAAGCACG AGCCACAACG CTCACCGATC 360 

AAGGT CGTTG CCGCAATGGC TTGA 384 
(2) INFORMATION FOR SEQ ID NO.:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: P14K 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.:22: 

Met Ala Leu Cys Leu Thr Ser Leu Gly Ser Pro Arg Arg Leu Pro Trp 

5 10 15 

Trp Ser Ala Cys Thr Arg Pro Val Ser Phe Ser Gly Lys Thr Gly Pro 

25 30 

Arg Pro Ser Pro Pro Lys Ser Thr Leu Pro Arg Ser Ala Gly Glu Ser 

45 

Val Asn Asp Thr Tyr Tyr Arg Gin Trp Val Ser Ala Leu Glu Lys Leu 

->U 55 



35 40 



60 



Val Ala Ser Leu Gly Leu Val Thr Gly Gly Asp Val Asn Ser Arg Ala 
65 70 75 * 80 

Gin Glu Trp Lys Gin Ala His Leu Asn Thr Pro His Gly His Pro He 

85 90 95 

Leu Leu Ala His Ala Leu Cys Pro Pro Ala He Asp Pro Lys His Lys 

100 105 110 

His Glu Pro Gin Arg Ser Pro He Lys Val Val Ala Ala Met Ala 
115 120 125 

(2) INFORMATION FOR SEQ ID NO.:23: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 21 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.:23: 

GATGCGGCCA TAGGCGAATT C 

(2) INFORMATION FOR SEQ ID NO.: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.:24: 
ACCGCCACCG ACTACCTGCA G 



21 



21 
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(2) INFORMATION FOR SEQ ID NO.: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 25: 

GTCAGCCTGA GCAATCTGCA G 

(2) INFORMATION FOR SEQ ID NO.: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.:26: 

GAATTCGGAA AAAATCGTAC G 

(2) INFORMATION FOR SEQ ID NO.:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1401 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
( D > TOPOLOGY : 1 i ne a r 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: AMI DAS E 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 27: 



21 



21 



ATGAGTTCGC 


TAACCCGCCT 


CACCCTCGCG 


CAAGTTGCGC 


AGAAACTTAA 


GGCACGGGAA 


60 


GTCTCCGCCG 


TTGAAGTTCT 


GGACGCCTGT 


CTGACGCAGG 


TGCGCTCCAC 


CGAAAAACAG 


120 


ATCAGTGCGT 


ACGTGTGCGT 


GCTGGAGGAT 


CAGGCCCGTG 


CAGCAGCCCA 


CGCAACTGAC 


180 


GCCGACATCC 


GCGGGCGCTG 


GAAAGGCCCG 


CTGCATGGCG 


TGCC7GTAGC 


GGTCAAGGAC 


240 


TTAT AC G AC A 


TCGCTGGCGT 


ACCCACCACG 


GCATCGTCGC 


CAGCGCACGA 


ATTGGACGCG 


300 


CAGCAAGACC 


CGGCTAGAGT 


CCGGCGCTTA 


CAAGACGCAG 


GTGCCGTTAT 


CCTTGGCAAG 


360 


ACCCATACGC 


ACGAATTCGC 


CTATGGCCGC 


ATCACTCCGA 


AGTCGCGCAA 


CCCCAGGGAC 


420 


CCGGGAAGAA 


CACCGGGTGG 


CTCCAGCGGC 


GGCTCGGCGG 


CCACGGTCGC 


AGCCTGCTGC 


480 


GTCTACTTGG 


CGACCGGCAC 


CG AC AC CGGT 


GGATCCGTTC 


GCATCCCTTC 


GTCGATGTGC 


540 
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AACACCGTAG 


GCCTGAAGCA 


ACCTACGGTC 


GGCCGCGTGC 


ACGGTGCCGG 


TGTGAGTTCA 


600 


CTTTCCTGGA 


GCCTGGACCA 


TCCAGGCCCG 


ATCACGCGCA 


CCGTGGAAGA 


CACGGCGCTC 


660 


ATGCTTCAGG 


TGATGGCTGG 


CTTCGATCCA 


GCCGACCCGC 


GGTCGTTGGA 


TGAGCCGGTG 


720 


CCCAGCTATG 


CCGAAGGGCT 


CGGCCAAGGC 


GTGAAAGGCC 


TGCGCTGGGG 


TGTGCCGAAG 


760 


AACTACTTCT 


TCGACCGCGT 


GGACCCGGAA 


GTTGAAAGTG 


CGGTTCGTGC 


CGCCATCGAT 


840 


CAACTGAAAG 


AGCTGGGCGC 


CGAACTGGTG 


GAAGTCGAAG 


TGCCCATGGC 


CGAGCAGATC 


900 


ATCCCGGTGA 


AGTTCGGGAT 


CATGCTACCC 


GAAGCCAGCG 


CCTACCACCG 


CACGATGCTG 


960 


CGCGAGTCAC 


CCGAGCTCTA 


CACCGCCGAT 


GTCCGCATAC 


TGCTGGAACT 


CGGAGATCTA 


1020 


GTCACCGCCA 


CCGACTACCT 


GCAGGCGCAG 


CGCGTCCGTA 


CGCTGATGCA 


GCGCGCGGTG 


1080 


GCC GAGATGT 


ACCAGCGCAT 


CGATGTGCTG 


ATCGCACCCA 


CACTGCCCAT 


CCCGGCTGCT 


1140 


CGCAGCGGGG 


AGGAGGTCCA 


CACATGGCCG 


GACGGCACGG 


TAGAGGCGTT 


GGTCATGGCC 


1200 


TAT AC GCGCT 








LLAwLrCTbAA 


CCTGCCCTGT 


1260 


GGTTTCTCCA 


AGGATGGGTT 


GCGATCGGCA 


TGCAGATCAG 


GCCGGCCGCT 


GGACGAGAAG 


1320 


ACCCTGCTGC 


GTGCTGGGCT 


GGCCTACGAG 


AAAGCCACGA 


CCTGGCACCA 


GCGTCATCCG 


13B0 


GAACTGATCG 


GAGCGGGCTG 


A 








1401 



(2) INFORMATION FOR SEQ ID NO.: 28: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 4 66 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : unknown 

(D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

/ 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: AMIDASE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 28: 

Met Ser Ser Leu Thr Arg Leu Thr Leu Ala Gin Val Ala Gin Lys Leu 
15 10 15 

Lys Ala Arg Glu Val Ser Ala Val Glu Val Leu Asp Ala Cys Leu Thr 

20 25 30 

Gin Val Arg Ser Thr Glu Lys Gin lie Ser Ala Tyr Val Cys Val Leu 
35 40 45 

Glu Asp Gin Ala Arg Ala Ala Ala His Ala Thr Asp Ala Asp He Ara 

50 55 60 

Gly Arg Trp Lys Gly Pro Leu His Gly Val Pro Val Ala Val Lys Asp 
65 70 75 80 
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Leu Tyr Asp lie 



Glu Leu Asp Ala 

100 

Ala Gly Ala Val 
115 

Gly Arg lie Thr 
130 

Pro Gly Gly Ser 
145 

Val Tyr Leu Ala 



Ser Ser Met Cys 

180 

Val His Gly Ala 
195 

Gly Pro lie Thr 
210 

Met Ala Gly Phe 
225 

Pro Ser Tyr Ala 



Gly Val Pro Lys 

260 

Ser Ala Val Arg 
275 

Leu Val Glu Val 
290 

Phe Gly lie Met 
305 

Arg Glu Ser Pro 



Leu Gly Asp Leu 

340 

Arg Thr Leu Met 
355 

Val Leu lie Ala 
370 

Glu Val His Thr 
385 

Tyr Thr Arg Phe 



Ala Gly Val Pro 
85 

Gin Gin Asp Pro 



lie Leu Gly Lys 

120 

Pro Lys Ser Arg 
135 

Ser Gly Gly Ser 
150 

Thr Gly Thr Asp 
165 

Asn Thr Val Gly 



Gly Val Ser Ser 

200 

Arg Thr Val Glu 
215 

Asp Pro Ala Asp 
230 

Glu Gly Leu Gly 
245 

Asn Tyr Phe Phe 



Ala Ala lie Asp 

280 

Glu Val Pro Met 
295 

Leu Pro Glu Ala 
310 

Glu Leu Tyr Thr 
325 

Val Thr Ala Thr 



Gin Arg Ala Val 

360 

Pro Thr Leu Pro 
375 

Trp Pro Asp Gly 
390 

Thr Ser Phe Gly 
405 



Thr Thr Ala Ser 
90 

Ala Arg Val Arg 
105 

Thr His Thr His 



Asn Pro Arc Asp 

140 

Ala Ala Thr Val 
155 

Thr Gly Gly Ser 
170 

Leu Lys Gin Pro 
185 

Leu Ser Trp Ser 



Asp Thr Ala Leu 

220 

Pro Arg Ser Leu 
235 

Gin Gly Val Lys 
250 

Asp Arg Val Asp 
265 

Gin Leu Lys Glu 



Ala Glu Gin lie 

300 

Ser Ala Tyr His 

315 

Ala Asp Val Arg 
330 



Asp Tyr Leu Gin 
345 

Ala Glu Met Tyr 



He Pro Ala Ala 

380 

Thr Val Glu Ala 
395 

Asn Val Thr Gly 
410 



Ser Pro Ala His 
95 

Arg Leu Gin Asp 
110 

Glu Phe Ala Tyr 
125 

Pro Gly Arg Thr 



Ala Ala Cys Cys 

160 

Val Arg He Pro 
175 

Thr Val Gly Arg 
190 

Leu Asp His Pro 
205 

Met Leu Gin Val 



Asp Glu Pro Val 

240 

Gly Leu Arg Trp 
255 

Pro Glu Val Glu 
270 

Leu Gly Ala Glu 
285 

He Pro Val Lys 



Arg Thr Met Leu 

320 

lie Leu Leu Glu 

335 

Ala Gin Arg Val 
350 

Gin Arg He Asp 
365 

Arg Ser Gly Glu 



Leu Val Met Ala 

400 

Leu Pro Thr Leu 
415 
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Asn Leu Pro Cys Gly Phe Ser Lys Asp Gly Lev Arg Ser Ala Cys Arg 

420 425 430 

Ser Gly Arg Pro Leu Asp Glu Lys Thr Leu Leu Arg Ala Gly Leu Ala 
435 440 445 

Tyr Glu Lys Ala Thr Thr Trp His Gin Arg His Pro Glu Leu He Gly 
450 455 460 

Ala Gly 
465 
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WHAT IS CLAIMED IS: 

1. A nucleic acid fragment encoding the a 
subunit of a stereospecif ic nit rile hydratase enzyme, 
said nucleic acid fragment having at least a 64% base 

5 homology with the a subunit coding region of the 

Rhodococcus rhodochrous Jl L-NHase gene and said enzyme 
capable of catalyzing the hydrolysis of racemic aryl-2- 
alkane nitriles to the corresponding R- or S- amides. 

2. A nucleic acid fragment encoding the P 

10 subunit of a stereospecif ic nitrile hydratase enzyme, 
said nucleic acid fragment having at least a 52% base 
homology with the P subunit coding region of the 
Rhodococcus rhodochrous Jl L-NHase gene and said enzyme 
capable of catalyzing the hydrolysis of racemic aryl-2- 

15 alkane nitriles to the corresponding R- or S- amides. 

3. A nucleic acid fragment comprising the 
nucleic acid fragments of Claims 1 and 2 encoding a 
stereospecif ic nitrile hydratase enzyme, said enzyme 
capable of catalyzing the hydrolysis of racemic aryl-2- 

20 alkane nitriles to the corresponding R- or S- amides. 

4. A nucleic acid fragment encoding the a 
subunit of a stereospecif ic nitrile hydratase enzyme, 
said nucleic acid fragment having the nucleotide 
sequence as represented in SEQ ID NO . : 3 and said enzyme 

2 5 capable of catalyzing the hydrolysis of racemic aryl-2- 
alkane nitriles to the corresponding R— or S- amides. 

5 . A nucleic acid fragment encoding the P 
subunit of a stereospecif ic nitrile hydratase enzyme, 
said nucleic acid fragment having the nucleotide 

30 sequence as represented in SEQ ID NC.:4 and said enzyme 
capable of catalyzing the hydrolysis of racemic aryl-2- 
alkane nitriles to the corresponding R- or S- amides. 

6. A nucleic acid fragment encoding both the a 
and P subunits of a stereospecif ic nitrile hydratase 

35 enzyme, said nucleic acid fragment having the 

nucleotide sequence as represented in SEQ ID NO.: 17 and 
said enzyme capable of catalyzing the hydrolysis of 
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racemic aryl-2-alkane nitriles to the corresponding R- 
or S- amides . 

7. A polypeptide a subunit of a stereospecific 
nitrile hydratase enzyme, said a subunit having the 
amino acid sequence as represented in SEQ ID NO.:l and 
said enzyme catalyzing the hydrolysis of racemic 
aryl-2-alkane nitriles to the corresponding R- or S- 
amides . 

8. A polypeptide (3 subunit of a stereospecific 
nitrile hydratase enzyme, said P subunit having the 
amino acid sequence as represented in SEQ ID NO . : 2 and 
said enzyme catalyzing the hydrolysis of racemic 
aryl-2-alkane nitriles to the corresponding R- or S- 
amides . 

15 9. A stereospecific nitrile hydratase enzyme 

comprising 1) the polypeptide a subunit of Claim 7 and 
2) the polypeptide p subunit of Claim 8 in proper 
conformation such that said enzyme catalyzes the 
hydrolysis of racemic aryl-2-alkane nitriles to the 

20 corresponding R— or S- amides. 

10. A 6.5 kb nucleic acid fragment encoding a 
nitrile hydratase enzyme and the accessory nucleic acid 
fragments necessary for the enzymes ? s active expression 
as characterized by the restriction map shown in 

25 Figure 2. 

11. 'A recombinant expression vector comprising 
the 6.5 kb nucleic acid fragment of Claim 10, said 
vector capable of transforming a suitable host cell for 
the expression of active stereospecific nitrile 
hydratase as characterized by the restriction map shown 
in Figure 3 . 

12. A transformed microbial host cell comprising 
the nucleic acid fragment of Claim 6 wherein said host 
cell expresses active nitrile hydratase enzyme capable 

35 of catalyzing the hydrolysis of racemic aryl-2 nitriles 
to the corresponding R- or S- amides . 

13. A transformed microbial host cell comprising 
the 6.5 kb nucleic acid fragment of Claim 10 wherein 
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said host cell expresses active nitrile hydratase 
enzyme capable of catalyzing the hydrolysis of racemic 
aryl-2 nitriles to the corresponding R- or S- amides, 

14. The transformed host cell of Claims 12 or 13 
5 wherein said host cell is selected from the group 

consisting of bacteria, yeast, and filamentous fungi. 

15. The transformed host cell of Claim 14 wherein 
said host cell is a bacterium selected from the group 
consisting of the genera Escherichia, Pseudomonas , 

10 Rhodococcus, Aclnetobacter , Bacillus r and Streptomyces . 

16. A transformed host cell of Claim 14 wherein 
said host cell is a yeast selected from the group 
consisting of the genera Plchla, Hansenula, and 

Sa ccharomyces . 

15 17. A transformed host cell of Claim 14 wherein 

said host cell is a filamentous fungus selected from 
the group consisting of the genera Aspergillus, 
Neurospora, and Penlcllllum. 

18. A transformed host cell of Claim 15 wherein 
20 said host cell is Escherichia coll. 

19. An expression vector comprising 1) a 5.0 kb 
nucleic acid fragment of the 6.5 kb nucleic acid 
fragment of Claim 10, and 2) a nucleic acid fragment 
having the nucleic acid sequence as given in SEQ ID 

25 NO. :20, wherein said nucleic acid fragment encodes an 
amidase enzyme, and wherein said expression vector is 
capable of transforming suitable host cells for the co- 
expression of active stereospecif ic nitrile hydratase 
and amidase. 

30 20. A transformed host cell comprising the vector 

of Claim 19. 

21. A transformed host cell of Claim 20 wherein 
said host is selected from the group consisting of the 
genera Escherichia, Pseudomonas, Rhodococcus, 
35 Aclnetobacter, Bacillus, Streptomyces, Hansenula, 

Saccharomyces , Plchla, Aspergillus, Neurospora, and 
Penlcllllum . 
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2. A method for converting a nitrile of the 



formula 



R1 

I 



: 



A i , 

R 2 
I 

(R.S) 



CN 



wherein : 



A is selected from the group consisting of 





A-1 



A-2 




A-3 





CI 



A-4 



A-5 




A-6 



H 3 CO 




A -7 




A-8 



A-9 
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and 




A-1 1 



R 1 is Ci-C^ alkyl; 
R 2 is H or OH; 

R3 is H, Cl f OCF 2 H, <CH 3 ) 2 CHCH 2 , H 2 C=C (CH 3 ) CH 2 NH, 







O 
II 

o 



or 




and 



R 4 is CI or F; 
5 to the corresponding amide comprising 

1) contacting said nitrile with the 
transformed host cell comprising a nucleic acid 
fragment having the nucleotide sequence represented by 
SEQ ID NO.:17 that stereospecif ically converts the 

10 racemic nitrile to the corresponding enantiomeric R- or 
S-amide, the host cell selected from the group 
consisting of Escherichia, Pseudomonas, Rhodococcus, 
Aclnetobacter, Bacillus, Streptomyces , Hansenula, 
Saccharomyces, Pichla, Aspergillus, Neurospora, and 

15 Penlcillium, 

2) recovering the product of Step 1) . 
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23. A method for converting a nitrile of the 
formula 



1 



A — C CN 



? 



wherein : 

A is selected from the group consisting of 




A-1 



A-2 




CI 



A<3 




A-4 




CI 



A-5 




A-6 



H 3 CO 




CH 
I 

N 




A-7 




H 0 C 




A-8 



A-9 
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F 

A-10 A-11 

R 1 is C x -C 4 alkyl; 
R 2 is H or OH; 

R 3 is H r CI, OCF 2 H, <CH 3 ) 2 CHCH 2 , H 2 C=C (CH 3 ) CH 2 NH, 




; and 



R 4 is CI or F; 

5 to the corresponding enantiomeric (R) or (S) -carboxylic 
acid comprising 

1) contacting the nitrile with the 
transformed host of Claim 20 that stereospecif ically 
converts the racemic nitrile to the corresponding 

10 enantiomeric (R) - or (S) -carboxylic acid, the host cell 
selected from the group consisting of Escherichia, 
Pseudomonas, Rhodococcus, Acinetobacter , Bacillus, 
Streptomyces, Hansenula, Saccharomycss r Pichia, 
Aspergillus, Neurospora, and Penicillium, and 

15 ' 2) recovering the product of Step 1) . 

24. A nucleic acid fragment encoding the a and P 
subunits of a stereospecif ic nitrile hydratase enzyme, 
said portion of the nucleic acid fragment encoding the 
a subunit having at least a 64% base homology to the 

20 Rhodochrous Jl L-NHase gene and said portion of the 

nucleic acid fragment encoding the (5 subunit having at 
least a 52% base homology to the Rhodochrous Jl L-NHase 
gene, said enzyme catalyzing the hydrolysis of racemic 
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aryl-2-alkane nitriles to the corresponding R- or S- 
amides . 

25. The polypeptide encoded by any one of the 
nucleic acid fragments of Claims 1, 2, 6, or 24. 
5 26. A transformed Escherichia coli comprising 

pSW2 and identified as ATCC 69888. 

27. A transformed Escherichia coli comprising 
pSW17 and identified as ATCC 69889. 

28. A transformed host cell comprising the 

10 nucleic acid fragments of Claims 1 and 2, wherein said 
transformed host expresses an active nitrile hydratase . 

29. A nucleic acid fragment corresponding to the 
P14K region of the 6.5 kb fragment as shown in 
Figure 10 wherein said fragment encodes a polypeptide 

15 having an amino acid sequence as represented in SEQ ID 
NO . : 22 . 

30. A nucleic acid fragment of Claim 29 where 
said fragment has the base sequence as represented in 
SEQ ID NO. :21. 

20 32. A P14K polypeptide encoded by the nucleic, 

acid fragment of Claim 29 and having the amino acid 
sequence as represented in SEQ ID N0.:22, wherein the 
expression of said polypeptide is required for the 
bioactivity of expressed Pseudomonas putida NRRL-18668 

25 stereospecif ic nitrile hydratase capable of catalyzing 
the hydrolysis of racemic aryl-2-alkane nitriles to the 
corresponding R- or S- amides. 

33. A nucleic acid fragment encoding a 18668 
amidase having an amino acid sequence as represented in 
SEQ ID NO.:28, wherein said amino acid sequence may 
encompass amino acid substitutions, deletions or 
additions that do not alter the function of said 
amidase . 

34. A nucleic acid fragment of Claim 33 where 
said fragment has the base sequence as given by SEQ ID 
NO.:27. 

35. An 18668 amidase polypeptide encoded by the 
nucleic acid fragment of Claim 33 and having the amino 
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35 
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10 



15 



20 



acid sequence as represented in SEQ ID NO.:28 r wherein 
said amino acid sequence may encompass amino acid 
substitutions, deletions or additions that do not alter 
the function of said amidase, 

36. A transformed microbial host containing the 
nucleic acid fragments of Claim 6 and Claim 29 said 
host capable of catalyzing the hydrolysis of racemic 
aryl-2-alkane nitriles to the corresponding R- or S- 
amides . 

37. A transformed host according to Claim 36 
wherein said host is a member of the genus Plchia . 

38. A transformed microbial host containing the 
nucleic acid fragment of Claim 6, Claim 29 and Claim 33 
said host capable of catalyzing the hydrolysis of 
racemic ary 1-2 -alkane nitriles to the corresponding R- 
or S- amides and the conversion of said amides to the 
corresponding enantiomeric (R) or (S) -carboxylic acids. 

39. A transformed Plchia pastcris comprising 
pSWSO and identified as ATCC 74391. 

41. A transformed E . coll comprising pSW37 and 
identified as ATCC 98174. 

42. A transformed E. coll comprising pSW23 and 
identified as ATCC 98175. 

43. A method for converting a nitrile of the 



formula 



A— 




CN 



<R,S) 



wherein : 



A is selected from the group consisting of: 
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H 

CI N 




F 

A-10 A-11 



R 1 is C x -C4 alkyl; 
R 2 is H or OH; 

R 3 is H, CI, OCF 2 H, (CH 3 ) 2 CHCH 2r H 2 OC (CH 3 ) CH 2 NH , 

©5- ■ (Sr- . 



70 



BNSDOCID: <WO. 



9712964A2J_> 



WO 97/12964 



PCT/US96/15969 




; and 



R 4 is CI or F; 
to* the corresponding amide comprisina 

1) contacting said nitrile with a transformed 
host cell of Claim 36 comprising the nucleic acid 

5 fragments of Claim 6 and Claim 29, the host cell 

capable of stereospecif ically converting the racemic 
nitrile to the corresponding enantiomeric R- or 
S-amide, the host cell selected from the group 
consisting of Escherichia, Pseudomonas, Rhodococcus, 
10 Acinetobacter, Bacillus, Streptomyces , Hansenula, 

Saccharomyces , Pichia, Aspergillus r Neurospora, and 
Penicillium, 

2) recovering the product of Step 1) . 
44. A method for converting a nitrile of the 

15 formula 




I 

(R.S) 

wherein : 

A is selected from the group consisting of: 

H 




CI 

A-1 A-2 A-3 
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F 

A-10 A-11 



R 1 is Ci-C^ alkyl; 
R 2 is H or OH; 

R 3 is H, CI, OCF 2 H, <CH 3 ) 2 CHCH 2 , H 2 C=C (CH 3 ) CH 2 NH , 

©6- &~r 




; and 



R 4 is CI or F; 

to the corresponding enantiomeric (R) or (S) -carboxylic 
acid comprising 
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1) contactina the nitrile vith the 
transformed host cell of Claim 39 that stereo- 
specifically converts the racemic nitrile to the 
corresponding enantiomeric (R)- or ( 3 ) -carboxylic acid, 

5 the host cell selected from the group consisting of 
Escherichia , Pseudomonas , Rhodococcus , Acinetohacter, 
Bacillus , Streptomyces, Hansenula r Saccharomyces , 
Pichla, Aspergillus , Neurospora , and Penicillium, and 

2) recovering the product of Step 1) . 
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SEQ ID NO: 17 : 



CGGGAGCGCA 


k ATCTGCAAGG 


i TGGCATTGGC 


: CTTCAGTGTC 


GATGCCGAGT 


TGAAGTCGCT 


60 


GT ACCC CTTT 


TTTCAACCAC 


: ACCAGGAGAA 


CCGCACCATG 


GGGCAATCAC 


ACACGCATGA 


120 


LCACCATCAC 


GACGGGTACC 


AGGCACCGCC 


CGAAGACATC 


GCGCTGCGGG 


TCAAGGCCTT 


180 


tataAOTCTCTG 


CTG AT CO AG A 


AAGGTCTTGT 


CGACCCAGCG 


GCCATGGACT 


TGGTCGTCCA 


240 


AACGTATGAA 


CACAAGGTAG 


GCCCCCGAAA 


CGGCGCCAAA 


GTCGTGGCCA 


AGGCCTGGGT 


300 


GGACCCTGCC 

• 


TACAAGGCCC 


GTCTGCTGGC 


AGACGCAACT 


GCGGCAATTG 


CCGAGCTGGG 


360 


CTTCTCCGGG 


GTACAGGGCG 


AGGACATGGT 


CATTCTGGAA 


AACACCCCCG 


CCGTCCACAA 


420 


CGTCTTCGTT 


TGCACCTTGT 


GCTCTTGCTA 


CCCATGGCCG 


ACGCTGGGCT 


TGCCCCCTGC 


480 


CTGGTACAAG 


GCCGCCGCCT 


ACCGGTCCCG 


CATGGTGAGC 


GACCCGCGTG 


GGGTTCTCGC 


540 


GGAGTTCGGC 


CTGGTGATCC 


CCGCCAACAA 


GGAAATCCGC 


GTCTGGGACA 


CCACGGCCGA 


600 


ATTGCGCTAC 


ATGGTGCTGC 


CGGAACGGCC 


CGGAACTGAA 


GCCTACAGCG 


AAGAACAACT 


6 60 


GGCCGAACTC 


GTTACCCGCG 


ATTCGATGAT 


CGGCACCGGC 


CTGCCAACCC 


AACCCACCCC 


720 


ATCTCATTAA 


GGAGTTCGTC 


ATGAATGGCA 


TTCACGATAC 


TGGCGGAGCA 


C ATGGTT AT G 


780 


GGCCGGTTTA 


CAGAGAACCG 


AACGAACCCG 


TCTTTCGCTA 


CGACTGGGAA 


AAAACGGTCA 


840 


TGTCCCTGCT 


CCCGGCCCTG 


CTCGCCAACG 


CGAACTTCAA 


CCTCGATGAA 


TTTCGGCATT 


900 


CGATCGAGCG 


AATGGGCCCG 


GCCCACTATC 


TGGAGGGAAC 


CTACTACGAA 


CACTGGCTTC 


960 


ATGTCTTTGA 


GAACCTGCTG 


GTCGAGAAGG 


GTGTGCTCAC 


GGCCACGGAA 


GTCGCGACCG 


1020 


GCAAGGCTGC 


GTCTGGCAAG 


ACGGCGACGC 


GCGTGCTGAC 


GCCGGCCATC 


GTGGACGACT 


1080 


CGTCAGCACC 


GGGGCTTCTG CGCCCGGGAG 


GAGGGTTCTC 


TTTTTTTCCT 


GTGGGGGACA 


1140 


AGGTTCGCGT 


CCTCAACAAG 


AACCCGGTGG 


GCCATACCCG 


CATGCCGCGC 


TACACGCGGG 


1200 


CAAAGTGGGG 


ACAGTGGTCA 


TCGACCATGG 


TGTGTTTCGT 


GACGCCGGAC 


ACCGCGGCAC 


1260 


ACGGAAAGGG 


CGAGCAGCCC 


CAGCACGTTT 


ACACCGTGAG 


TTTCACGTCG 


GTCGAACTGT 


1320 


GGGGGCAAGA 


CGCTTCCTCG 


CCGAAGGACA 


CGATTCGCGT 


CGACTTGTGG 


GATGACTACC 


1380 


TGGAGCCAGC 


GTGATCATGA 


AAGACGAACG 


GTTTCCATTG 


CCAGAGGGTT 


CGCTGAAGGA 


1440 
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